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APPLICATIONS OF TWO OSCtTLATORY FORMULAS 
By John L. Robeets 

INTEODUCTION 

The main purpose of this paper is to illustrate how Mr. Jenkins’ osculatory 
formulas (A) and (E) can be applied in a convenient manner. The first section 
of this paper will be little more than a summary of some of the formulas con- 
tained in the other three articles. The second section will contain the appli- 
cations. 

I. SOME MATHEMATICS OF THE FORMULAS 

The Woolhouse notation will in this paper be used to stand for the differences, 
of which represents the given values of a function. The general formulas are 

2/* = yo + x^ya -b ix(x - 1)5 + ^x(x - l){x - |)(7; (1) 

and 

= Uo + xtti + ^x{x ~ 1)B + ^xix - l)(a: - |)C. (2) 

The special formulas belonging to (2) are 

5 = 6 — , and C = Ci - fci , (A) 

where h and d are defined by b = f(&o + h) and by d = |(do + di); and 

B = b and (7 = 0. (B) 

The special formulas belonging to (1) are 

2/0 = Mo + fbo; 5 = ^ and C = 0; (C) 

and 

j/o = Mo ~ TO^o I B — b gd, and C — ci gcj . (D) 

Formula (C) is equivalent to Mr. Jenkins' formula (A). Also (D) is equivalent 
to his formula (B). 


* This paper presupposes a knowledge of three other articles. The first one by Mr. 
Wilmer A. Jenkins is entitled “Graduation Baaed on a Modification of Osculatory Inter- 
polation,” and is printed in the October 1927 issue of the Transactions of the Actuarial 
Society of America. The other two papers are mine. One of them is entitled "Some 
Practical Interpolation Formulas,” and is printed in the September 1935 issue of these 
Annals. The other one. entitled “A Family of Osculatory Formulas” is printed in the 
October 1935 issue of the Transactions. 
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II. applications of (c) and (d) 

First, there is the problem of selecting suitable examples to which (C) and (D) 
can be' applied. Secondly, we will then apply in a convenient manner the 
formulas to these examples, 

The problem of selecting suitable , examples will now be considered. ''The 
non-reproducing characteristic of'" formula (D) "raises the question of what 
will happen in the graduation of a series whose fourth differences arc all posi- 
tive, say. The answer is that the graduated series will lie everywhere below 
the observed points and that the observations will not be correctly represented 
by the interpolated series.” On the other hand, if we select a series whose 
fourth differences change frequently in sign, (D) because of its non-reproducing 
characteristic has valuable smoothing possibilities. In like manner, (C) may 
be valuable when the second differences change frequently in sign. Mr. Jenkins 
gives at quinquennial ages rates of mortality which were graphically determined 
from the puUished American Men Ultimate Experience. Since the fourth 
differences of these rates change frequently in sign, we will apply (D) to a few 
of these rates. So far as I know no suitable actuarial examples have been 
found to which (C) can be applied. However, there is the possibility that (C) 
might be valuable in some sciences. Since I do not know of any suitable real 
example to which (C) can be applied, we will apply it to a trivial series whose 
second differences change frequently in sign. 

We are now ready to apply in a convenient manner (C) and (D) to the 
examples selected in the preceding paragraph. 

First, we will apply (G), I have in my other article applied (B) in a con- 
venient manner. This method with little change can be applied to (C).' If 
it is desired to apply (C) at either end of the table where values of Ux are not 
available for the calculation of the second differences, it can be assumed they 
vanish. It is convenient if S and represent respectively the major differ- 
ences Aux and A^Ux in such a manner that they are arranged centrally in the 
working illustration. It is convenient if s and represent respectively the 
minor differences Spx and ■ The quantity can be computed by yo = 
Uo + J&o , and yi can be computed in like manner. Since we wish in the working 
illustration of (C) to interpolate four values between yo and 2 / 1 , the middle 
s = 8y A = .2Ayo , and = .04B = .02(6o Hr l>i)- We can by the use of the 
foregoing method apply (C) to suitable functions, whose given values can be 
represented by f(r). Then, it follows from the definition of Ux that f(r) — Ux . 
It might prevent confusion if it is stated that x and r are related to each other 
in such a way that we always interpolate between yo and 2 / 1 . We shall now 
apply (C) to the case when f(r) represents the trivial series shown at top of 
page 3. 

Finally, we will apply (D). Mr. Henderson has applied (A) in a very con- 
venient manner. His method with little change can be applied to (D).' If it 
is desired to apply (D) at either end of the table where values of Ux are not 
available for the calculation of the differences required, it can be assumed 
that the fourth differences that can not be competed vanish, and, the required 
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differences can be filled in consistently with that assumption. It is convenient 
if S, and represent respectively the major differences A'ia, , , and 

in such a manner that they are arranged centrally in the working illustra- 
tion. It is convenient if s, and represent the minor differences so that 
by definition s ^ = di/x j == si = and / = sV* • The first 

= .04(60 — ido). The last = b^y.% — . 04 (61 — |di). The quan- 
tity 2/0 can be computed by 2/0 = -- -^do , and yi can be computed in like 

manner. The middle s == by,^ = , 2 A?/o — s^ We are now in position to 
apply (D) to the quinquennial rates of mortality. 


Age 

Rate 

S 




72 

.07010 

.03808 




77 

.10818 

.04669 

.00861 

.01799 


82 

. 15487 

.07329 

.02660 

- .01946 

-.03745 , 

87 

.22816 

.08043 

.00714 

. 12572 

.14518 

92 

.30859 

.21329 

.13286 

.12572 

.00000 

97 

.52188 


.25858 


.00000 
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Age 

Vx 

5 

^2 

5^ 

82 

.15591 

12612 

.001314 


83 

,168522 

13527 

915 


84 

.182049 

.014043 

516 

- .000399 

85 

.196092 

14160 

117 


86 

.210252 

13878 

-.000282 


87 

.22413 

13460 

- .000682 


88 

.237590 

13977 

.000517 


89 

.251567 

.015693 

1716 

.001199 

90 1 

.267260 

18608 

2915 


91 

.285868 

22722 

4114 


92 

.30859 

28006 

.005314 


93 

.336596 

34326 

6320 


94 

.370922 

.041652 

7326 

,001006 

95 

.412574 

49984 

8332 


96 

. 462558 

59322 

9338 


97 

. 52188 


.010343 




SOME SIMPLE DEVELOPMENTS IN THE USE OF THE 
COEFFICIENT OF STABILITY 


By C. H. Forsyth 

Some time ago the writer proposed^ a coefficient of stability C, to be used 
to measure the stability of a statistical series, where that coefficient is defined 
by the relation 


where M denotes the arithmetic mean and o' the square of the dispersion of 
the terms of the series. It was proposed to regard series as unstable (Lexian) 
for which the value of the coefficient exceeded unity, and stable otherwise. 
The only essential way in which such a procedure differs in results from the 
traditional method is that it includes as stable those series for which the value 
of the coefficient lies between unity and q the probability of failure of the event 
under investigation — series which would be classed as unstable according to 
the traditional method. Stable series — according to either standard — are found 
so rarely in practice and therefore so many series are accepted as fairly stable 
which come anywhere near meeting the requirements that replacing q by unity 
as the line of demarcation affects the classification of no known series but 
adds to the effectiveness of the avowed purpose and use of the proposed coeffi- 
cient— to avoid the round-about work of computing values of probabilities. 
Another merit of the use of the coefficient is that it enables one to measure 
and therefore compare the stability of several, series — a feature which we shall 
illustrate later. 

In brief, such a coefficient provides a means of introducing the whole' Lexian 
theory into Federal publications such as those on vital statistics, since a com- 
parison of the values of the coefficient for, say different communities or countries, 
would be readily grasped by any reader, whereas the traditional method would 
prove too subtle and laborious, and allow no ready comparison of results. 

For purpose of orientation let us illustrate the situation by analyzing a simple 
series both ways— the traditional way and by the use of the coefficient of sta- 
bility. As an example, let us consider the death rates of white infants under 
one year of age for 1919 (considered on page 89 of the Handbook) for those 
states whose frequencies of births are comparable or which vary little from 


’ Journal of the American Statistical Association, June, 1932 . 
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their average of 47,830— where the number, of deaths for each state has been 
adjusted to this average as a base. 


Adjusted 
Deaths X 


Cal..,. 

3350 

Conn 

4700 

Ind 

3732. 

Kan 

3253 

Ky 

3686 

Minn 

3159 

N. Car 

3541 

Va 

3732 

Wis 

3780 


9)32933 


M == 3659 


X - 3059 

(A' - 30539)2 

-309 

95481 

1041 

1083681 

73 

5329 

-406 

164836 

27 

729 

-500 

250000 

-118 

13924 

73 

5329 

121 

14641 

1335-1333 

) 1633950 


181660 = ff' 

0- = 426 


The traditional method would be : 

The mean M = np = 3659 where n = 47,830. 

„ 3659 , 44171 

’ 47830 «l3b 

and cth =, np (7 = 3659 Qyggj j = 3378 

whence (t^ = 68,15 


which is the value of the dispersion we should expect if the basic probability 
were constant throughout. Bpt the value of the dispersion proves to be 
(T — VI 8 I 550 = 426, and the comparison of the values shows that the basic 
probability to be very variable therefore the series to be very unstable or 
Lexian. 

The computation of the value of the coefficient of stability is much more 
simple and direct 


^ ^ 181550 

M “ 3659 


49,6 


whose excess oyer unity also clearly indicates the instability of the series. 

Since proposing the coefficient of stability the writer has been impressed by 
the overwhelming proportion of existing series (such as birth rates, various kinds 
of death rates, etc>) which employ arbitrary bases (such as ^^per thousand/^ 
'*per ten thousand,” etc.) usually without mention of the actual base. It is 
obvious, of course, that such rates, or occurrences per arbitrary base, say 5, 
can first be adjusted to give occurrences per actual base, say B (assuming that 



USE OF COEFFICIENT OF STABILITY 


7 


base 5* can be determined) but the work can evidently be performed much 
easier. For, since the original scries (per arbitrary base b) Zi , X 2 y ^ ‘ • Xn 

would become, on adjustment, ‘ the mean would become 

000 

and the square of the dispersion ) whence the formula for the coeffi- 

cient of stability would become 


a = 


-1 i 

M' b 


( 2 ) 


As an example, let us consider the general death rates, per 10,000, of New 
Zealand for the years 1921 -SO. 



X 

X -86 

0^ 

00 

1 

1921 

87 

1 

1 

1922 

88 

2 

4 

1923 

90 

4 

16 

1924 

83' 

-3 

9 

1925 

83 

-3 

9 

1926 

87 

1 

1 

1927 

85 

-1 

1 

1928 

85 

-1 

1 

1929 

88 

2 

4 

1930 

86 

0 

0 


10)862 

M = 86.2 

10-8 

)46 

4.6 


This example illustrates the danger of using the coefficient of stability unless 
the series consists of actual occurrences or unless the actual base is given due 
consideration. Without due consideration of the actual base (here the popula- 
tion of New Zealand) one might easily fall into the error of regarding the value 
of the coefficient of stability as 4.6/86.2 and, therefore, the series as very 
stable. But the population of New Zealand is about a million and a half and, 
therefore the true value of the coefficient of stability is 

^ ^ 1, 500,000 ^ n . 

' 86.2 10,000 


* Strictly speaking, this actual base B should be constant throughout the series; other- 
wise the suceeasive numbers of occurrences— the terms of the series— would not be com- 
parable. Where, however, the base B varies little from term to term — as usually happens 
even in the best of series, such as a series of some kind of rates of the same community 
over a short interval — the variation can be ignored, in which ca^c base B (to which the 
terms of the series are adjusted) usually means the arithmetic mean of the different bases. 
In the first treated above, the investigation was limited to certain states in an effort to 
comply with the rule just mentioned but the example is a poor one since the variations 
are still dangerously too largo. The situation is saved by the conclusive results. 
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which shows the series to be unstable. However, before we condemn New 
Zealand’s death rates too severely, let us compare her record with those of 
other important countries, including our own, for the same period. 


General Death Rates (per 10,000) 

M 


New Zealand 86,2 

Australia 94.3 

Sweden 120.4 

Scotland 137.3 

Austria 151.1 

United States 118.0 

England-Wales 121.3 

France 170,3 

Spain 193,7 

Italy 163.5 

Germany 125.4 

Japan 206.4 


G, 

8 

90 

96 

139 

536 

830 

1117 

1129 

2190 

2760 

6040 

6800 


These results show how extremely unstable most series of general death 
rates are and that the series for New Zealand, while unstable according to 
our strict criterion, enjoys quite an enviable position practically in a class by 
itself. Parenthetically, these results also illustrate fairly well the triviality, 
with respect to results, of replacing g by unity as the critical value of the coeffi- 
cient of stability, discussed at the beginning of this article. 

The values of the coefficient listed above would, of course, be reduced some- 
what in most cases if the trend of the series were first eliminated but the writer 
has gone though all this’ work and found it not worth while — that is, the series 
would still remain markedly unstable. , , 


Another development proves useful when, as frequently happens, the actual 
base B is unknown to a degree of accuracy desirable for use in formula (2). 

2 D 

Prom the inequality rij- — g 1 
M 0 


we obtain 



(3) 


which is to be used to show how small an actual base should be for the given 
series to be stable. As an example, let us consider the maternal mortality, 
per 10,000 live births, in the so-called expanding registration area of the United 
States. 
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Maternal Deaths in the United States (per 10,000 live births) (Expanding 

Registration Area) 


X X - 68 iX - 66)2 


1923 

67 

1 

1924 

66 

0 

1925 

65 

-1 

1926 

66 

0 

1927 

65 

-1 

1928 

69 

3 

1929 

70 

4 

1930 

67 

1 

1931 

66 

0 

1932 

64 

-2 


10)666 

66.5 

9-4 


1 

0 

1 

0 

' 1 
9 

16 

1 

0 

_4 

)_^ 

3.3 


Hence, by formula (3), B g ^ ( 10 , 000 ) or about 200 , 000 . The number 

( of live births varies so greatly that we should probably find it impossible to 
agree upon a satisfactory number^ to use as an actual base for such an '' ex- 
panding area^^ but we should all agree that it would be so much greater than 

200.000 that the instability of the series would be unquestioned. 

One must be careful in comparing the results of two or more investigations 
like the One just conducted. For example, the analogous result for Canada, 
for the same period yields B g 113,000 and we might conclude, too hastily, 
that the United States series is more stable (or less unstable) whereas any 
knowledge whatever of the numbers of live births of the two countries would 
show that Canada comes much closer to fulfilling her requirement than the 
United States and that the palm must go to Canada. For one thing, Canada 
has about the population of New York city and New York city has about 

100.000 live births annually. In any case, cZose decisions in matters of this 
kind would be difficult without sufficient information in regard to actual bases. 

There is still another situation which is interesting but of much less impor- 
tance because of the rarity of its occurrence. , It will be recalled that the coeffi- 
cient of stability was devised mainly to avoid the use and computation of 
probabilities and that the only difference between the results by the traditional 
method and by the use of the coefficient of stability lies in the trivial replace- 
ment of the critical value q by unity. In the traditional method of analysis, 
but by conaparing the value of the coefficient of stability with 3 , the coefficient 
is evidently always, strictly speaking, a function of the actual base B. In 
other words, there is no statistical series, however stable it may seem — except 


2 It was in the neighborhood of two million in 1932. 
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for the trivial case when all the terms of the series are exactly the same but 
what would be unstable if the base were small enough. It is possible to formu- 
late the limit once for all below which the given (otherwise seemingly stable) 
series would prove unstable. 

If, in the relation a S npq (tor stability) we replace p by M/n, g by 1 — Mjn 
and then n by B, we obtain 


, g or ^ 


<M- 


whence, finally 



(4) 


where the transference of the term M — from one side to the other should 
cause no apprehension since, by hypothesis, < M and ikf — is therefore 
always positive. We propose to employ formula (4) in those rare cases where 
the value of the coefficient of stability of actual occurrences — but without 
reference to an actual base — ^is less than unity — that is, where the given series 
proves to be stable according to the method proposed by the writer — and 
determine the upper limit of the values of the base B for which the series would 
be unstable according to the traditional method of analysis. As an illustra- 
tion, let us consider the familiar series of annual football fatalities in this country 
for the period 1906-1930* (omitting the years when no records were kept). 


Football Fatalities 


1906 

11 

1917 

12 

1907 

11 

1921 

12 

1908 

13 

1923 

18. 

1909 

12 

1925 

20 

1911 

11 

1926 

9 

1912 

13 

1927 

17 

1913 

5 

1928 

' 18 

1914 

13 

1929 

12 

1915 

15 

1930 

13 


It is easily verified that which is clearly less than unity; whence 

the series clearly seems stable. Applying formula (4) 


B a 


13.055' 

13.055 - 11.942 


or 153 


which shows that the given series is stable as long as the total number of foot- 
ball jjlayers exceeds the number 153. A recent news item quoted an estimate 
of the number players participating in games of four hundred colleges as about 
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13,000 and over 600,000 including high schools and all. We can then definitely 
say that the series just considered is stable. Such a conclusion has no bearing, 
of course, upon what might happen if other terms were added to the series. 
It happens that adding the records for the next five years— 1931(33), 1932(32), 
1933(27), ‘1934(25), 1935(30)— would change the whole series to an unstable 
one with Cs = 56.9/16.6 = 3.4; but, obviously, the additional records belong 
to a new regime of collection. 



INTERHAL AND EXTERNAL MEANS ARISING FROM THE SCALING 
OF FREQUENCY FUNCTIONS 

By Edwaud L. Dodd 

The scaling^ of frequency functions has been discussed from the standpoint 
cf maximum likelihood. But the likelihood criterion to be satisfied sometimes 
leads to a minimum likelihood; and sometimes to neither a maximum nor a 
minimum. Scaling will be studied in this paper with reference to the likelihood 
actually secured, and also with reference to the character of means obtained, 
whether internal or external. 


SECTION 1. INTRODUCTION 


It is well known that a scale obtained in a curve-fitting process is sometimes 
a mean. Thus, with the normal function 


( 1 ) 


a 


if the scale a is to he obtained from measurements, Xi, x^, - • - , Xn, v/e com- 
monly accept the value 

( 2 ) 

that is, the root-mean square of the measurements. Here, the positive value 
of a is naturally taken. It is called the standard deviation, and thought of as 
an appropriate new unit of measure. , 

But even with the x’s ail negative, and the a taken positive, 0. Chisini“ con- 
sidered it proper to regard a as a mean of the a:’s, albeit an external mean. 
From Chisini’s viewpoint, this o whether regarded as positive or negative is 
primarily a solution of 


(3) 3!; d" Xj -p -f a:^ = 0^ -(-... -p ct*. 

In this sum of squares, the single number a may be subditutei for each of the 
X s. Perhaps this kind of mean should be called a suhsUiutive mean to dis- 
tinguish it from the means of general analysis which are always internal. 


* Fisher, R. A,, “On the jnathernatical foundation of theoretical statistics,” Philo- 
sophical Transactions of the Eoyal Society of London, Series A, Vol. 222, 309-S68, (1921). 
See p. 338. . ! i \ / 


’ Chisini 
(1929). 


, 0., Sul concetto di media,” Peiiodioo di raatetnatlco, Series 4, Vol. 9, 10&-116 
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The normal function is a particular case of a more general function: 

(4) Constant ^{t) = t = x/a. 

The likelihood method to find the scale a for this function leads to power means, 
including tile arithmetic mean, the root-mean-square, root-mean-cube, etc., for 
p = l,2f 3, etc. 

The word scale will be used only for a positive number, — which then may be 
regarded as a unit of measurement. 

Tor measurements, Xij x^, • ■ • j Xn Chisini regarded M as a mean, relative 
to a function G, provided 

(5) G(xuX2, = G{M,M, ... 

If a solution of this equation is 

(6) M = F(xi, xs, . . . , Xn),. 

and c is a possible value for the x's, it follows at once that 

(7) F(c, c, - ■ ■ , c) = c, 

or at least one value of this P is c. Conversely, if (7) is satisfied, it is but a 
change of notation to replace c in (7) by M, and to combine this with (6) to 
obtain 

( 8 ) F{xi, 0 ^ 2 , , Xn) = FiM, • • • , M). 

Hence, this F which in (6) gives explicit form to the implicit M found in (5) 
may also be thought of as a mean-forming function, such as G in (5). Briefly, 
F is a particular G, Thus F{xif ^2, » • • a:n) is a mean of a:i, 2:2, • • • , if F 
is so constructed that (7) is satisfied when the arguments are all equal. 

Inasmuch as a frequency function /(i) is non-negative, loge/(0 is real, — say 
(f>(t) plus constant. Following R, A. Fisher, it will be convenient to write 

(9) f{t) = " C = Constant 

With location m already determined, the will be thought of as measured, 
from m. And we set 

(10) t = x/a^ ii = Xi/a^ i ^ 1, 2, • • • , n. 

The ^^productive^^ probability — to yield Xi^ 0:2, ••• , Xn — is then 

(11) L = Ufiti) = C a-" 

This is proportional to the “likelihood” of a. Also — it may be noted in 
passing — the productive probability is also proportional to the a posteriori 
probability, if a constant a priori probability is postulated. The likelihood 
will here be taken as itself; and it will be designated by L , — in Fisher’s 


*Loc. Cit., Fisher, p. 310. 
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notation, L = log H. Of course, H and log n take maximum values simul- 
taneously, if at all. From (11) it follows’ that 

(12) ' -a.0 log L/da = n -1- 2t,^)>'(ti) == l{ii4>'{U) + Ij- 
The equation 

(13) 2t,<)>'(ti) -h n = 0 • (f = 1, 2, . . . , n) 

will be called the likelihood condition, whether this leads to maximum likeli- 
hood, to minimum likelihood, or to neither, A second dtfferention^ leads to 

(14) a.0' logL/0o' = -n = - 1). 

When negative, this indicates a maximum likelihood; when positive, a minimum 
likelihood for the a obtained from (13). ^ 

Preparatory to the theorems of the next section, just one more matter will 
be discussed. The unit for i is arbitrary; and it may be convenient to write, 
with k 7 ^ 0, 

(15) = <i>(fcu) = $(«), t = ku. 

Then 

(16) = u^'{u). 

Suppose, now, that a positive constant fc can be found such that h^'{k) = — 1. 
Then, with t = ku, as postulated, 

(17) 1 •$'(!) = 4'(^) = -1- 

Thus $'(1) = -1, — or as it will now be written (p'(l) = — 1, — is no more 
restrictive than the condition that some positive k exists such that k4>'(k) = — 1. 

SECTION 2, GENERAL THEOREMS- CONCERNING THE SCALE AS A MEAN 

Theorem I 

Given the frequency function 

,/(0 = Ga ' t = xfa, U = Xi/a, 0 = Constant. 

And suppose that 

( 19 ) = - 1 . 

Suppose, also, that for given xi, xj, ■ ■ • , x„, the likelihood condition (13), 
now written 

(^*1) (*i/«)4>'(x,7o) -f Tl = 0, ■ 


*LoO, Cit,, Pisher, p. 338, 
*Loo, Cit., Fisher, p, 339. 
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has 'a positive solution, 

(21) a = F{xi, xi, ... , x„). 

Then this a, the scale, is a mean. 

Proof. With each Xi = 0, (20) cannot be satisfied. 

But if, with c 0, we take each Xi = c, and at the same time set a = c, 
then, by (19), S — —n; and thus (20) which gives a implicitly is satisfied. 
The explicit a in (21) is therefore such a function F that (7) is satisfied. Hence, 
the scale a is a mean. 

Theorem II 

Given the frequency function 

(18) f{i) — CoT^ t = x/a, h — Xija, C = Constant. 

Suppose that 

(19) 0^(1) = - 1, 
and that 

(22) ■ 1 I < 1 if M 1 < I- 

Moreover, suppose that the likelihood condition (20) for measurements 
^ 2 , • • • , Xtv, has a positive solution a. Then 

(23) ■ a ^ Maximum j | . 

Or, suppose that, in place of (22), we have 

(24) I > 1 Ul > 1; 

and that Wify keeps the same sign, if | i [ > 1. Then 

(25) Minimum | | ^ a. 

Proof. Suppose, if possible, that a > Max \xi\. Then each \xi/a\ < 1, 
and by (22), | (xi/a)<t)'{Xi/a) | < 1. Then (20) is not satisfied, since | S ] < n. 
Thus the hypothesis is contradicted. 

Now (25) is satisfied ait once if any Xi = 0. But suppose, on the other hand, 
that Min \xi\ > 0; and, if possible, that a < Min \xi\. Then, by (24) et 
seq., since | Xi/a | > 1, it follows that | S | > n. And thus (20) is again con- 
tradicted. 

Theorem III 

Given the frequency function 

(18) f{i) = i = x/a, . U = Xi/a, C = Constant; 

and set 0(0 = ^0'(O + 1- Suppose that 

(26) lim 0(0 = a, lim 0(0 ,= /3, afi < 0. 
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And suppose that ^(0 is continuous when i 0. 

Then, for any sot of real numbers, Xi, Xi, ' • ■ , Xn, oi which none is zero, 
there exists a positive number a, as scale, such that the lilcelihoocl condition 

(20) -f n = 0 

is satisfied. 

The couchisioii is also valid, if in place of the limit (i, there is postulated 
(27) Urn rp{t) ^ ^ ff I oQ I ^ lim tpQ), 

1— (jHrO 0 

where b > 0, c > 0, and ip{l) is continuous for — b < i < 0 and for 0 < / < c. 
That ia, the new limits are to be infinite with sign opposite to that of a. 

Proof. The limits for f — ^ 0 and for j 1 1 — > <» are the same as the limits 
for a — ► 00 and a — ^ 0 4',— noting that i = x/a, x 7^ Q. Thu.s changes 
sign as a goes from 0+ to 00 * Hence, .since ^(/) is continuous, (20) is satisfied 
for some positive a. 

For the proof of the second part of the theorem, suppose that Xn > 0 and 
that aJn is the greatest Xi . Then with a > xjc^ but approaching a:„/c, \f'(Xn/o>) 
becomes infinite with sign oppo.sitc to that of a. Furthermore, in 2i/'(,T;,'/a), 
the positive »'a < Xn have a negligible effect; and thus lim S^(a:,/a), as 
a (Xfl/c) + 0, is infinite with sign opposite to that of a, when this sum 2 
is taken for the positive Likewise, if 3:1 < 0, and is tlie least X{ , lim 'S\p{x{/a), 

a.s a — » (— a!i/b) -f 0, is infinite with sign opposite to that of a, when this sum 
is taken for the negative x’a. If, now, the measurements happen to bo all 
positive, we think of a as approaching x„/c 0; and the continuity condition 
leads to an a which makes 2\p(xi/a) == 0. Likewise, if the measurements 
happen to be all negative, we use —xi/b + 0. If both positive and negative 
x's appear, we use the greater of the two ratios -xi/b and x„/c, 

SECTION 3. some FAIELV REGULAR FREQUENCY FUNCTIONS 

To illustrate the foregoing theorems in a somewhat general manner, consider 
tile measurements, xi, 3:2, ■ * , , Xn, and with t = x/a, it = xt/a, set up the 
function: 


(28) f{i) = Oa'* ( U r (1 + f h, 

whore, as before, C ia a suitably chosen constant. 

Suppose also that 


(29) 

p > -1, 5^0, 

r ^ 0, a ^ 0; 

and that »ther 



(3fl) 

r > 0, a > 0 or 

r & 0, 2^ > p 4- 1. 

J 

Then with <t(() 

- log/(t)> it follows that, when £ 0, 


(31) (*'(0 + 1 = (p + 1) - rsi- ( t f - 2qkh\l + iY)-‘. 


I 
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Now the condition — —1 would be satisfied if "^(k) = 0, where 

(32) ^(fc) = -H rf + (2g - p ~ l)k^ - (p + 1). 

But, under the conditions (29) and (30) ^(0) < 0, and ^^(®) > 0, Hence, 
there is a positive k for which ^(k) = 0. Then if h he assigned this value, 

(19) is satisfied; and by Theorem I, any scale a that the likelihood condition 

(20) may lead to is a mean, But, by Theorem III a scale a will actually exist 

— indeed, for any positive h that may be used in (29) ; since the limit of + 1 

is positive as i 0, and is negative as j i | -+ «> . 

Moreover, if in (29), the further conditipn —1 < p ^ 0 is introduced, (22) is 
satisfied. And, thus, a S Maximum | a;,- 1. Also, | | increases with [ i j. 

Hence, by (24) et seq., Minimum | a* [ ^ a. 

If in (28), we set q ^ 0, s =* 1, r > 0, and confine our attention to positive 
X and i, there is obtained the Pearson Type HI. Reference to (32) shows that 
« 0 if fc — (p -b l)/r. With this substitution, 

(33) fit) = & f C' ^ Constant. 

Since 4»^(1) = — !» any solution of the likelihood condition'is a mean. Here, 
with i > 0, Wij) = P — (p + 1)^1 and — 1 = — (p + !)■ From (14) 

we see that, with p d- 1 > 0, any mean obtained corresponds to maximum likeli' 
hood and the single maximum found is actually the largest value. Moreover, 
with the measurements, asi, JCa, • • • , all positive, a scale a will exist, — as 
noted in the general case (28). 

In passing, it may be noted that Type III appears® rather naturally in a 
form giving *^'(1) — " I at once, without any transformation. Here, then, a 
scale is a mean. 

Given the Pearson Type I in the form 

(34) f{i) S5 C<r^Q) + My{c ^ kt)^t t = x/a, & > 0, c > 0, | p? | > 0. 

If P + ? + 1 > 0, it is possible to find a positive h so that with = log/, 

^'(1) =5—1. In this case, any scale found by the likelihood condition is a 
mean. With k thus chosen, /(i) has essentially the same form as it would have 
if fc — 1. Hence for convenience, let us simply set /fc = 1 in the above equation. 
Then for — b < f < c, 

^{t) = + 1 ^ 1 + pt{h + i)“' “ ffKc - <)'"• 

Suppose now that p > 0 and ^ > 0. Then Theorem III may be applied; since 

lim ^(f) 1, as i ► 0; but lim —> — aci,asi— ►—bd- 0, oras^— — 0, 


^ Carver, H. C., Handbook of Mafchematical Statistics, Chap. VII, see p. 105, Line 4, 
noting that = y'/y. 
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Hence a scale a satisfying the likelihood condition exists, Moreover, the likeli- 
hood is at a maximum; since, with -I < i < c, 

tY(() - 1 = -pt\b + 0 "' - 0 ^' - 1 < 0 . 

This maximum is also the largest value for all values of a. 

If the Pearson Type IV is given in the form 

(35) m - Co'‘(l + kHT” t - x/a 

then if p > 1/2, it is possible to find a positive h vjhkh will make (1) = - 1 . 
In this case, any scale ^ is a mean. Moreover — for any fc ^ 0 — the limit of 
+ 1 is 1 for i 0 and is 1 - 2p for i Hence, by Theorem III, 

if p > 1/2, as above, then a scale a exists satisfying the likelihood condition (20) . 


SECTION 4. FREQUENCY FUNCTIONS ’WITH CERTAIN rECULlARlTlEB 

The theorems of section 2 give sufficient conditions, which in, some cases 
may not be necesaary. Nevertheless, by violating certain hypotheses, particu- 
lar functions may be set up which exhibit various peculiarities. 

For the Pearson Types, the differential equation is 



__ ,//A _ ffo + 

y(i) + 


t = xja. 


The determination of a positive scale a by the Fisher likelihood process is 
impossible here, in case ao = 0, ai > 0, 6o + b\i + > 0. For in this case 

^ 0; and thus (20) cannot be satisfied. The U-shaped Type H curves 
are in this class. Likewise, if Oo 0, ati ~ 0, and bo + bji + > 0, — For 

example, with 5a > 0, 5® < 4bo52 , — and the measurements all happen to have 
the same sign as a^, such scaling is impossible. 

For the purpose of constructing peculiar functions we may take c > 0 and 
require that the measurements X{ be either —c or c^with at least one — c and at 
least one c— and that <^(i) be an even function. Then (i>{-'c) = ^(c) and (11) 
becomes 

(37) L « ICa”' 

The likelihood condition (13) reduces to 


(38) 0 = <1,(1) = + 1 = (c/a)*'(c/a) + 1, 

with the tight member eu even fvmetion of c/a. And Iiom (14), a maximum 
likelihood is indicated when 


(39) (o/o)®i^"(c/a) - l.< 0, 

with the left member Ukewiae an .even tunoUon. A minimum likelihood is 
indicated if the left mettt.heT ia positive. 
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Let us apply this to the case where 

(40) m - (-2/3) log (1 - 3 1 1 1); - 2 1 1 1(1 - 3 ) i l)-^ 

The likelihood condition (38) is satisfied only when t = d=l. Also <^^(1) = — 1. 
Thus the only means are the internal means ±c; and the only scale conformable 
to (38) is 0 = c. But this has minimum likelihood; since 1 — 1 = ^ > 0. 

For positive t, this function (40) is a Pearson Type. 

Consider next a function of the form (28),^with p = g — -0.5, 

however, — for which (31) become^ 

(41) (4i'(0 + 1 = -1/i - iV4 + <7(1 + <’) ” -(1 ~ <7/4(1 + i7. 

whence <^'(1) =* ~1,4/>^'(1) = +1, <^'''(1) — —3. Here the likelihood condi- 
tion (38) has but a single absolute solution 1 1 1 = 1, leading to the single scale 
a = c, and to the two internal means, ±c. But, in this case 1 • <#>"(!) — 1 = 0, 
so that 9^ log L/da^ 0. Moreover, for i = 1, 9® log L/da^ = oT^ 0, Thus, 
the only scale obtained by the likelihood method (38) — viz., a = c— has a 
likelihood which is neither at a maximum nor at a minimum. 

Another anomalous function is that given by 

(42) 0(0 « - 2.m\ i = ±c/a. 

The likelihood condition (38) leads to 

0(0 = (1 - t®)(l - ii^) = 0, 

The only solutions are t = ±1, giving internal means ±c; and i = =fc 1/2, giving 
external means ±2c. And from (39) et seq., it can be shown that the internal 
mean and scale, a = c has minimum likelihood, while the external mean and 
scale, a = 2c, has maximum likelihood. 

But it will be noted that a maximum value for a vicinity does not always 
signify a largest value for the entire possible range. Indeed, for the function 

(42) , a = 2c has maximum likelihood without having the largest likelihood. 
To avoid such an anomaly, a necessary condition is that as [ ^ | — > c=o , 
0(0 — > — « ; as seen by taking the logarithm of L in (37), noting that as a -» 0, 
(-log a) ^ +CO. 

Finally avoiding the anomaly just mentioned, let us set up a frequency 
function, using the 0(0 in (38), and writing 

0(0 = 1 + «0'(O (1 “ 2t®)(l - 0(1 - 

From this it follows readily that ^ 

(43) , 0(0 =K - 1.95t' -k - 0.30 K = Constant. 

This, with U = ic/a, leads to an internal mean or scale a = c with minimum 
likelihood, a nearby scale a == c 'W'Hh maximum likelihood— differing 

indeed only slightly from the minimum just mentioned— and another scale 
0 = having maximum likelihood, and this likelihood is indeed greater 


i 
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than that for any other positive value of a. The external moan o - c\/2 
in this case has the largest likelihood. This may be checked by the uso of 
the logarithm of L as it appears in (37), in ivhich the important part is 
(/i(c/a) ^ log ft. 

In passing it may be noted that if if (0 has the form \lf{t) = (1 t)B (/), with 
H(l) and ii = Xijo.; then any solution a of the likelihood condition 
^(i) = 0 is a meaHj^by Theorem 1. 

SUCTION 5. BUMMAHy 

When the E. A, Fisher likelihood method is used to find an "optimum" scale 
for frequency functions, it sometimes happens that this scale is a well known 
mean or at least is a sMkim mean-See Equation (6), Or a simple trans- 
formation (16) may often put the frequency function into such a form. Con- 
ditions are given under which a scale will be a mean. Under further condi- 
tions this mean will be internal-'at least as regards absolute values. Finally, 
under certain conditions, a scale will exist, 

But for certain functions not satisfying these conditions, anomalies appear. 
The scale given hy the usual likelihood condition may be a scale with a minimum 
likelihood. Sometimes the likelihood will be at neither a maximum nor a 
minimum. In certain simple cases, no scale exists. Furthermore, It may 
happen that the scales which are internal means have minimum likoliliood and 
those that are external means have maximum likelihood. Among Pearson 
Types are found both anomalous functions and functions which would bo 
regarded as regular as regards maximum likelihood, 

In this problem of scaling, likelihood is proportional to ft foslerioTi probability 
with the ft ‘prim probability taken as constant, 



MOMENTS OF ANY RATIONAL INTEGRAL ISOBARIC SAMPLE 

MOMENT FUNCTION 

By Paul S. Dwtbh 
Introduction 

j 

The problem of moments of momenta has been investigated by a number of 
authors. The assumption of an infinite universe (or that of a finite universe 
with replacements) permits the application of the "algebraic’’ method, the 
method of semi-invariants as introduced by Thiele (1) and developed by C. C. 
Craig (2) and the combinatorial analysis method introduced by R. A. Fisher (3) 
and used by N. St. Georgescu (4), A combinatorial analysis method has the 
particular advantage that it enables one to compute separate terms of a given 
formula. 

The formulae for moments of moments have been simplified through the 
use of new moment functions. Thiele introduced the half-invariant (1) which 
resulted in considerable condensation. More recently Prof. R. A. Fisher (3) 
has introduced the sample function k whose expected value is a half invariant. 
The most compact formulization presented thus far is his formulation of the 
half invariants of the sample fc, in terms of the half invariants of the universe. 
This very compactness, however, makes it difficult to compare results with 
those expressed in the more conventional sample functions. Dr. Wishart has 
written a paper (7) in which he shows, among other things, how the Fisher results 
can be translated to the more conventional (Craig) results and vice versa, but 
such translation is' in general no simple matter. It appears that the Fisher 
results are not immediately useful to the statistician who desires the formulae 
to be expressed in terms of the usual sample moment function. On the other 
hand the Fisher formulization is a remarkable discovery toward that harmony 
which must be naturally inherent in the field of moments of moments. Soper 
(6, 111) expressed the general situation when he wrote, "If the terrifying over- 
growth of algebraic formulation accompanying this branch of statistical inquiry 
is destined to have a chief utility in induction and going back to causes, then 
perhaps Dr. Fisher’s way of estimating a sample will prove to be most fertile, 
but if it is to be applied to problems of deduction, say to problems of suc- 
cessive eventuation such as propagation, then Mr. Craig's plain moments seem 
to have a firmer hold on the exigencies of time." 

It would appear then that the Fisher formulae and the Craig formulae are 
both needed. Georgescu (4) showed a partial connection between them in 
applying to the m functions a combinatory analysis somewhat similar to that 
applied by R. A, Fisher to the k function. It is the purpose of the present 

21 
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paper to work out a combinatorial procedure for a more general sample function 
80 tlmt either the Fisher or Georgescu combinatorial results come out as special 
cases, In making such a generalization no limitation is placed on the sample 
tanction except that it bo rational integral and that all terms are of the same 
weight, Tims the results are applicable to lUr, mr -j- kr, mrkr, etc, as well 

as to rftr and K although they arc not applicable to ■%/ or In this way 

the important formulae for the nroments of a new sample moment function 
will be available by simple substitution as soon as any such new function is 
defined by a rational integral isobaric expansion of power sums. 

It is thus the purpose of this paper to determine the moments of a general 
moment function of the sample. This is done by keeping the irLultipUers of 
the various partitions of power sums indefinite until all manipulation is complete. 
It is then possible to assign the defiirite values of these multipliers which are 
associated with the desired sample function and to obtain the moment of 
the desired moment function in this way. Thus the Fisher result k( 42), and 
the Craig result V 2 ) are special cases of the new result Xnifi, /g). It 
is obvious that it is not possible to carry the results using these general moment 
functions as far as Fisher and Wishart (3), (6), (7), have carried the results of 
the decidedly advantageous (from the standpoint of simplicity of result) k func- 
tion and yet it ia surprising to find the simplicity which can be obtained in 
the general case. Incidentally the intToduction of the more general symbols 
clarifies the successive steps of the partition analysis which are somewhat con- 
fusing in any specific case because of the inscTtion of the value of the coeffi- 
cients of the power sums in which the sample moment function is expressed. 

This paper ig divided Into three parts. The first part includes the necessary 
definitions, the basic formulae, and the general development of the algebraic 
method. In order to facilitate the algebraic work there is inserted a table giving 
the expected values of all ppssible partition products of power sums whose 
weight 58, The second part deals with the difierent sample functions which 
might he used. The third part gives a list of the various partition foi'mulae, 
of weight 58, which contain no unit parts and shows how these can be used in 
writing the chief variations of the formulae for moments of momenta. 

Part I 

h 

1. General Moment Fuactions. Different moment functions have been de- 
fined in Various ways, but all moment functions have in common the property 
that they may be expressed in terms of the power sums, It appears sensible 
to Use this expression in terms of power sums as the working algebraic definition 
of tnoment functions. For example the function fca, which is defined by R. A. 
Fiaher to be that function of the sample whose expected value Is the third 
cumulant (half invariant) is to be given the working definition of 

i. = n® 3(2) (1) ^ 2(1) (1) (1) 

. (n - 1) (m - 2) (n - 1) (n - 2) n(n - 1) (n - 2) 
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where the numerical expressions in parentheses indicate power sums of the 
sample. 

Every term in the definition of a sample function has a “weight” which is 
equal to the sum of the power sums whose product is indicated by the term. 
Thus the weight of each of the terms of h is 3. If all the terms of a given 
moment function have the same weight, the function is called isobnric and 
the weight of the function is equal to the weight of each term. Thus is an 
isobaric moment function and its weight is 3. Since all the functions so far 
proposed are isobaric we limit this generalization of moment functions to iso- 
baric moment functions although it is possible that a more complex analysis 
could be worked out for non-isobaric functions. 

Generality demands the inclusion of every possible partition product of 
power suras. Such generality can be obtained by writing 

h - al(l) 

/a = 02(2) 4* 011,(1)* 

/a = Oa(3) + 02 i( 2)(1) -f- Oiu(l)^ 

« 04(4) + aai(3)(l) 4" a2j(2)* 4* 02u(2)(l)* 4" ou(l)* 

and in general 

/r = 2 oilipppT* Cpi)'‘ • ‘ ■ W* 

where (pa)'* ■ • ■ (p^^* indicates any partition product of power sums, 
ttpfi . .. is its coefficient and the summation is taken for every possible parti- 
tion. The number of parts of the partition is p — Sir. It may be assumed, 
without loss of generality, that the partition is ordered, i.e. 

pi ^ pz ^ Pa ^ ^ P* . 


A natural numerical coefficient of each term is the number of ways the r 
units can be collected to form the given partition. This value is given by 


r 


■I 


sP'^ Pa ^ * ■ • V»f (paO'* • • • (p* O'* iTi I TTzl ' • • ir« I 


If we sot 


r 


®/»T‘ p*' 


pv p;*/ 


Op'l ... 


the definition of fr becomes 


/r = 2 ^ j Ojii‘ pT* (Pi) 

\Pi • * ■ Pi / 


(p.)* 


In the present paper the capital letters are used to represent the corresponding 
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functions of the universe as defined hy the corresponding power sums of the 
univeTSe, Thus 



represents the corresponding function of the universe. In the case of the 
moment about the mean and the semi-invariant the Greek letters p and X have 
been used to represent the correspondiug function of the universe. In the 
case of functions whose notation is quite widely established, it is preferable to 
use the conventional notation, but in introducing new functions it appears 
wise to use the relationship between small and capital letters since the corre- 
spondence between the English and Greek alphabets is not exactly one to one. 
It should be particularly noticed that this notation does not agree with a pre- 
viously accepted scheme of using the small English letter to indicate the {unction 
whose expected value is indicated by the corresponding Greek letter. In the 
present paper it is not the expected value property which serves os the basis 
of notation hut rather the definition of the function in terms of the partition 
products of power sums, 

2. The Working Definition of Moments About a Fixed Point. The sample 
functions defined by 


tnj =s 



( 2 ) 


Wa s= 

n 



' n 


are obtained from/r by placing 

^ when 5 = 1, Ti ^ 1, and p\ ^ r, 

• V 

0 in all other oases. , 



The Greek is used to indicate the corresponding function of the universe. 

3. The Working Definition of Moments About the Mean. The moments 
about the mean are defined by 


m[ = 

n 




^ ( 2 ) _ ( 1 ) ( 1 ) 


n 


n 


3 » 


mi » 


(3)_3^ ^ 


Wi 


_ (4) _ 4(3) (1) , 6(2) (1)’ 

,2 + ■ 


3(1)' 


n 




n* 
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and in general rrir is obtained from/r by placing 

f 1 . 

- if s s= 1 =s 1, and pi = r, 

n r I r 

(- 1 )'* . 

— if pi > 1, Ti 1, s 2, and pj = 1. 

It pi = 1, s = 1. and iri = f. 

. 0 in all other cases. 

The corresponding moments of the universe are indicated by the conventional 
For conciseness moments about the mean are referred to as “momenta*” 

4. The Working Definition of the Half Invariants. The half invariant 
moment functions of Thiele, as applied to the sample power sums are [see C. C. 
Craig (2, 7-10) and Frisch (12, 20-21)]. 

? - ^2) „ OHl) / _ (3) ^ 3(2) (1) W’ 

* n ’ % 'n? ' ^ n a* a® 

, (4) _ 4(3) (1) _ 3^' 12(2) (1)^ _ 

^ “ n a* a® n* 


and in general 






so that 


no 


Oj,ll .. 


vPi' Vi'i 


(p.)’' (p,)*’ • • ■ ip.)" 


IP. — 

pM* ^ 




no 


The corresponding moments of the universe are indicated, after Thiele (1) 
and Craig (2), by X, R. A, Fisher (3) used k while Georgescu (4) used s. 

In the present paper these functions are referred to as “Thiele momenta.” 


6. The fc Functions of R. A. Fisher, The fc statistics of R. A. Fisher are 
defined in terms of the sample power sums by 


*: = w, fe = 

n 


( 2 ) 


(D* 


» — 1 «(» — 1)' 


n(3) _ 3(2) (1) ^ 2^’ 

“ (« - 1) (ft (ft - 1) (ft - 2) »w 

, B(n + 1 ) (4) 4(b + 1) Is) (1) _ 3(2)“ _ 12(2) (1)’ _ 6^ 

^ " in - 1)W" (ft - 1)™ (ft - 2)<» (ft - I)"’ »«> ■ 
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These values and values for hi and fc* are given by K.. A. Fisher (3, 203-4) 
while algebraic methods of attaining them are presented in sections 16j 17. 
They are referred to as Fisher moments. The corresponding functions of the 
uni verse, if used, would be represented by Kf. 


6. The h Function. Just as Fisher introduced a sample function whose 
expected value is a Thiele moment of the universe, so it is possible to introduce 
a function whose expected value is a moment of the universe. Such a function 
is defined by 


n 


n 


1 n{n — 1)' 


ha = 


niS) 


3(2) (1) 


{n - i)(n - 2) (n- l)(n - 2) 


+ 


2(1)= 


n 


m 


, in^ - 2n + 3) (4) Ain^ - 2n + 3) (3) (D 3(2n - 3) (2)' 
"" (ti - 1)(« 


n 


( 4 ) 


+ 


6(2) (1)^ 3(1)* 

(n - l)w n«> ■ 


Methods of obtaining the expansion of this function in terms of power sums 
■ are presented in section 18. The corresponding function of the universe, if it 
were used, would be represented by Hr. 


7. Other Moment Functions, It is possible to obtain an indefinite number of 
moment functions. For example one might define a function of weight 2 whose 
■variance equals fUj (or >ia), It is possible by the methods of this paper to 
find expressions for such moments. 

For reference purposes Table I is provided showing the values of a for each 
partition of weight <6 for the functions m', m, I, h, fc. The values of 

( '■ ) - 

are also inserted, in the left' hand column, so that it is possible to read from the 
table the values for / = mj, ntr, Ir, h when r < 6. 

8, Products of / Functions. The product of two or more isobario functions 
is also isobario and of weight equal to the sum of the weights of the functions. 
Thus 

Sifi = M) + auCl)(l)Ml)] = aiai(2)(l) + ai,ai(l)* 

/*/[ = a,aK2)Cl)* + auatd)*. 

In multiplying by any. term of fr( is of weight and when It la multi- 
plied by any term of weight rs, the result is a term of weight n + rj . 
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TABLE I 

Coejfieienis of Products of Power Sums in the Expansion of Different Moment 

Functions 


Nuiticri- 








cal 
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coefli- 
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Wr 


hr 

cient 
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Oi 
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1 

1 

1 

1 


■ 

n 

n 

n 

n 

n 

1 


1 

1 

1 

■| 

1 

1 


n 

• n 

n 

n — 1 

71—1 

1 

an 

0 

-1 

-1 

-1 

-1 







1 

as 

1 

1 

1 1 

n 

n 

n 

n 

n 

(n - 1)0J 

{n - 1)(3) • 

3 

(hi 

0 

-1 

-1 

-1 ‘ 

-1 




(n - l)l» 

(b - 1)(® 

1 

am 

0 

2 

2 

2 ' 

2 

w® 

70 


7i(») 

1 

04 

B 

1 

1 

w(w + 1) 

71^-271 + 3 

n 

n 

n 

(» - 1)«> 

(71 - 1)(« 

4 

aai 

0 

-1 

-1 

(n + 1) 

(n ^ 1)(« 

7l“ - 271 + 3 

7l(« 

3 


0 

0 

-1 

-1 

2n — 3 

022 





(71 - 2)<» 

TlO) 

6 

0211 

0 

1 

2 

2 

i 


n.® 

(71 - 1)(« 

(71 - 1)»> 



0 

-3 

-6 

-6 

-3 

1 

Oim 

n* 




1 
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1 

1 

n\n + 6) 

n{n^ — 6?i + 10) 
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n 

{71 ~ 1)<« 

1—1 

1 

5 

041 
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-1 

70 

n{n rf- 5) 

(n - 1)»> 

7i' - 5n + 10 
in - 1)«*) 
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TABLE l~-Concluded 


Numeri- 

cal 

cqeEB' 

cient 

a 

f 

tnr 

Ir 

kr 

hr 

10 

ttau 

m 

1 

71® 


2(71 + 2) 

(ft ^ 1)(^J 

71^ — 4ft + 8 

ft(®> 

15 

Oszi 

0 



2(ft “ 1) 

(71 - 1)<<) 

(2ft - 4) 

+ ^( 5 ) 

10 

02111 

0 

-1 

-6 

6 

. 1 


n* 

U ~ i)i« 

(71 - i)w 

1 

ttimi 

0 

4 

,n® 

71® 

24 

ft<®> 

4 

71<®J 


R. A. Fisher [3, 207] used the product klh as an illustration of the algebraic 
method. The more general ftft gives 

jifi — [ttaCS) + 3a5i(2)(l) + aiu(l)*ft^j!(2) + aii(l)(l)] 

= a5a2(3)(3)(2) + aSaii(3)(3)(l}(l) + (2)^(1) 

+ [Ooiiaijiciii + 2a8Q4aiii](3)(2)(l)^ -f 9fl2iaa(2)^(l)* + 2a3aiiiaii(3)(l)^ 

' + [aaaiflmaz + 9a2ian](2)*(l)”' + [fiazittmaii + a2ain](2)(l)® *f aiuaii(l)® 

which reduces to the value as given by him when the values of a are substituted 
from Table I. 


9. The Expected Value of Any Partition Product. The expected values of 
partition products are well known and are indicated by 

■ Ei'pi) «Mpi 

®(pi)(p2) = ^Mpj+pj d" 'W'(w ~ l)apiapj 
^ipMvdiVi) - ^Ppi+Pi+Pi + - 1) Wp,+fiiiip3 4- Pvi-i-PtPpi + Mpi+PiMiJ 

+ n(n — l) {n - 2) ^PlHp3tip3 . 

and in general 


£(»)"(?.)'■ ■ • • (p.)" - 2 


■pVpi' ■■■ 

.5?' Ji' 


P'.- 

3? 


, ip'Mr ■ ■ ■ (( 4 .)’“ 


where t — xi ,+ Xb *f Xa + • • • + x* 




,3i“ q}‘ 


9! 


Xi 


indicates the 


I 
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number of 'ways in which the partition pj* •- pj* can be grouped to 
form the partition qP • ■ • 

The continued application of the result above leads to a large number of 
formulae, In order to make these results accessible I present in Table II the 
expected values of all partition products of weight ^8. The essence of the 


table is the evaluation of the expression 


'pv pr ' ’ • 

-Jfl rtXJ 

,qi 54 



The numbers 


at the top of each column indicate the subscripts of the ju's which must, of 
course, be multiplied by The entries on the extreme left are the numerical 
coefficients associated with each row. 


10. The Expected Values of the / Functions. With the use of Table II one 
is able to write expressions for the expected values of fr when r < 9. 

- aiJVjuI 

p 1(/2) ™ = (a? + an)ntii-^ ann{n - l))u(* 

= ^(/s) — (u3 + 3fl4i + ctm)nni + 3(c2i + Utii)w(n — 1 )p3Mi 
+ ainn(n — !)(« — 2)p[® etc. 

If the expected values of the / functions are expressed in terms of the moiaentB 
about the mean of the universe, these formulae become, since w * 0 

Pi(/i) - 0 

pi(/a) — (us + Uii)n^8 
pk/a) ~ d" d" o-iiOnpa 

— (®4 d" dflji + 3032 0U411 d" Ullll)WM4 

d" 3(a!t2 d" 2aaii -b flmi)w(w ~ l)/4a etc. 
These may be written more symbolically as 

p:(/i) = 0 

' Pi(/4) “ ftaW/ij 

nl(/a) = 6anm 

PxC/v “ "b 3524n’(n 1 )m 3 Otc. 

U. The Expected Value of Products of / Functions. The expected value of 
products of f functions may be similarly found. For example 

w(/2) “ S{ji) = £tffs(2) + iiii(l)Y = o|B(2)’ + 2ajOu®(2)(l)(l) + o?iE(l)‘. 





























































weight “ 0 
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TABLE II — Continue 

TV 1 »w| j ),^U> fi^t) 

6 I 51 I 42 I 33 Ull 32l| 2^ 31*1 2* 1*1 21< 1‘ 



weight - 7 



321* 2’ 1 I 31* 2* V 21' r- 
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TabJe II can now be used by indicating aj as a multiplier of 2aiaii ns a 
multiplier of £^(2)(1)(1) and au as a multiplier of (1)\ Then at once it is 
evident that 

— {(i\ "h SosAii + (^i)nfn -f- (fl2 + 2020]! -f- 3an)n(n — 1 )m2 
= (^2 + aii)^nfii “h [(a^ d- au)^ + 2a\i]n{n ^ l)jx4 
^ “h (ba -|- 2bii)w(7i — 

Similarly 

MiiC/sj/a) = bibiUta d" (ba&s + Shibz 662 i 6 ji)'rt(n ^ 1)/^3W 

^^(/a) = 6a'n/i& 4" (Qblt 4" 6b3b2i)'a(a — 1 )m 4M2 4“ (ba 4“ 9l)2i)‘n.(7i ” l)>ia 
4- (96li 4- 66;n)u(n - l)(u - 2)nl 

etc. 

I 

■where bs — Ca 4" 3<r2i 4“ ®iii# bgi = aji 4* aiui bm ~ am* The important 
special cases are obtained by assigning the proper values to the a's as given 
in Table I. Thus 

tizimz) = i [(n — 1)^4 4- — 2n ^ ^){n — D^z] 

72.® 

which agrees with the corrected result of ‘‘Student" in 1908 (8, 3) and Tchou- 
proff (10, 192). Similarly 

wis) = -^ [(n - 1)^ (n - 2)^8 4- (ti - 1) (w - 2) U - 5?^ + 10)/i5/i2] 
w 

juj(wa) ^ i [(w — Xf{n — 4" (-“On 4- 16) “!)(« — 

w 

4- (/ - 2w 4- 10) in - 1) in - 2)^^ + (9n' - 36w 4- 60) (n - 1) (» - 2) A] 

etc, 


In the same way 


V/ 1 I “ 272-4" 3)a<2 


^2) — ^ 4" " 


nin “ 1) 

fi6 , “ 5a 4- 10)ju3P2 


72- 


n(n — 1) 


V; \ I (“Oa 4' I 4" l0)/i3 (9n^ — 3 6 a 4~ 60} ^a 

ti 2 W =-4-'4,(.^^iy- + 72(71-1) 7i{n - l)(n “ 2) 


71(71 — 1 ) 
etc. 
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^2(7712) ^ ^ 
n 

Ull(wia, 7 W 2 ) = — “I" l)/^3/^3!l 

n 

^ "1“ (^ “ l)w] 

n 

etc. 


12 . The Expected Value of the Products of f Functious in Terms of the 
Thiele Moments of the Universe. The formulae giving the ju’s in term if the 
N's are 

P2 =f ^2 

f 

fig = ^3 

7^4 *= ^4 "h 3 Xj 
pj == Xj -|r lOXjXj 

pb = Xf -j- 16X4X2 4 * lOXa "h 46 x 1 



where the summation holds for those partitions having no unit parts. See 
the results of Craig (2, 7 - 11 ) and Frisch (12, 21 ). It is at once possible to 
express the moment formulae in tenns of the Thiele moments of the universe. 
Thus the general results above become 

^ tjwXi 4 “ [ 36*71 4 “ (62 4 " 2611)71(71 — l)]X2 

Mii(/ 4 j fi ) = 636271X5 4 “ [ 10 bj 6 j 7 i 4* (6462 4 ~ 362162 4 " 66 ji 6 ii)?i( 7 i “ 1)]X3X2 

m!(/s) = 6}riXB + [ 16 blw + ( 96^1 + 663621)71(71 - I)]XiX2 

4 “ [lObjTi 4 " (63 4 “' 9621)71(71 ^ l)]Xa 

4 “ [ISb*?! 4" ( 27 bji 4 - 1863621)71(71 — 1) 4 ^ (6621 4~ 66111)71(71 — ' l)( 7 i — 2)1X2. 


13 . The Thiele Moments of the fs in terms of Thiele Moments. It is 
now possible to reduce to the Thiele moments of the /'s by means of the usual 
relations ' 

X2(/.) - M2(fr) - /iU) 

XllC/ru/rj) ^ , frt) " Ml{l(/rw /ri)/4(n(/rj , /rj) 

' X3(/r) == ;i8(/r) " 3M2(j'r)Ml(/r) 4^ 2jwl®{/r) 


etc. 
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so that the results become 

M(fi) — 'b bi\n(u — 1 )]X 3 

Xii(/3, /») = 63h2?l^6 + {3[6ib2n + hubanin — 1)] -1- Slfcahgn + hnbnn{n — 

— hawXe + {Olh^Ti. + hQhiin{n — 1)] -f- 9[b37j -)- }}\in{n — 1)]}A4X2 

H- 9[6aV 4- bUin - 1)JXS + {9[5aV -f 2b,hin(n ^ 1) + hU{n -- 1) + bW^^] 
4" Q[hln “b 3b5i?i(7i — 1) -f h\jxn{n — l)(n. — 2)]}X3 

etc. 

The formulae as written are adapted to the partition representation of Part III, 
When the are equal to the m’s we have 


XaCwia) = 


_ {n - \)\i . 2{n - 1)X 




+ 




XiiCwa m) = -"■2)X6 , 6(n - 1) (« - 2)X3X3 


Xa(ma) = ^ ~ 2)^Xfl , 9(n - l)(w - 2 )^X<X 2 

~ -oB ' 




, 9(« - 1) (n — 2)*Xa . 6(7^ — 1) (n — 2)Xz 

T ZTa \ 


IT 


n‘ 


etc. 


which are the results aa previously given by C. C. Craig (2, 55) . In like manner 
when the /r = h 

n — 1 


fa) . 


fa(fa) = ^ + — 

n n ^ 1 (n — 


6n Xj 


(n — 1) (n - 2) 


etc. 


as given by R. A. Fisher [3, 210] while 

X 3 (?W 3 ) = -(X4 -h 2X2) 


Xii(^i^a> Wj) ~ ^(Xs 4“ QXsXO 
n 


Xi('ttifl) — - (Xj 4" ISXtXi 4" 4“ ISXi) • 

n 


etc. 
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14. Various Formulization of Results. Altliough different moment functions 
of the nnWerse may be used it is customary to express the results in terms of 
universe momenta about a fixed point, in terms of universe moinentSj or in 
terms of universe Thiele moments. It is possible to express results in any of 
the nine forms 


f moments about a fixed point (ti') 
fi(/,) > in terms of i moments (ft) 


HJr)] 


Thiele moments (X) 


where fr represents the isobaric sample moment function of weight r. One 
pm’pose of sucii varied formiiliaation is to discover the most compact form 
and also the one best adapted to use in the case of a normal universe or a uni- 
verse whose moments obey some discoverable law. As suggested above Craig 
(2) has .shown the relative compactness obtained by using \{mr) and Thiele 
moments of the universe while R. A. Fisher (3) has shown the great additional 
compactness obtained by taking = fer< 


15. The Application of the Algebraic Method to XaiC/si /?). Before leaving 
the algebraic method it is perhaps wise to outline the steps in the case of a 
more involved problem. We take the example which R. A. Fisher (3, 207) 
has used in the case in which /, = kr. To find Xjif/a, /j). 

The value of /a/a Was found in section 8. To find its expected value it is 
only necessary to enter the coefficients of the different partition products in 
this expansion at the left of the corresponding rows as indicated in Table II. 

The coefficient of any moment partition of the universe is found by multi- 
plying each column entry by its corresponding left row entry and then by 
multiplying by as indicated at the top. Thus the coefficient of jai is 


*b a^Qii d- "b -b ~b fiflziaa d' -b 


~b + dujjaiiiaii -b + Uiiiau)?! 


which after some algebraic work reduces to 


(^3 + Susi + aui)*(a2 -b niOn- = hllhn. 

In this manner it is possible to write the result either in terms of universe 
moments about a fixed point or in terms of universe moments, If moments 
are used, one may neglect all column partitions involving unity. 

It should be noted that the a's defining kr as given in Table I can be inserted 
here if desired. If these multipliers are introduced throughout the rows and 
columnar partitions involving unit parts are not used one will arrive at Table I 
of R, A. Fisher (3, 208] though there are some slight typographical errors in 
his rows for (3)“ (1),’” and (3) (2*) (1). 

Determining all the coefficients in this manner we find after considerable 
algebraic manip\ilation that 
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[&a^2 4" + 12&36ai&ii 6&362ibi]'n(Ti — 

4" |2b3b2 4" 15b2ib2 4" ISblibji 4" 6b3b2ibs 4“ 12b3b2ibji]7i(ft — 

4" [Sb^bji + Ob^ibj 4“ IBbLbu 4" 6b3b2ib2]7i(rt — 1)^5 -I- [SOb^bz 

' 4" 54b2ibii 4" 6b3!j2ib2 4“ 12536zibii 4" 1263b]iibi.] 4" 72b2ibijibii 

4" I86i]ib2]w(K — l)(?i — 2)//4/i2 4~ [babfl 4~ 6b3b2ibs 4^ ISbabjibn 

4" 27b2ib2 4“ 90b2ibii 4* SGbzibu^bz 4" 72biibiLibu 4- 36biubtil?i(^ “ 00^ ^ 2)fi3fi2 

4" [^bzibz 4~ ISbzibn 4" SGbzibnibn 4" Sbuibz 4“ 36biiibii)7}(7i — l)(w — 2)(tt — 3)ju2. 

If/r = hr ti)G proper values of b arc inserted and the expression above becQjTies 
that given by E. A. T'isher (3’j 208). For example the eoefficient of nt is 

(9n - 63w^ + 240n - 420) (n -- 3) 
ri^in - l)Hn - 2) 

when 


b2 = 


1 



1 

n(n — 1) 


bji = — 


1 


n(n — 1)’ 


bill ^ 


n(n — 1) (?r — 2)' 


The algebraic results involved in changing tlie general formula above to 
other function.? are too extended to present here. A symbolic means of attaining 
them i.s included in later sections of the paper. 


Part II. The Determination, of Specific / Functions 

16. Functions Determined by the b’s. In Part I it was shown how various/ 
functions are defined by giving definite values to the coefficients of the power 
sums. It is the purpose of this part of the paper to show how functions can 
be specified by means of their expected values in term.? of moments of the. 
universe. This is essentially the method used by R. A. Fisher in defining his 
h function and it is here extended to other functions. In this case the b's are 
first determined and the a’s arc then found from them. The first moments 
of /ij fi) fs wero given in section 10. To these we add, as shown by Table II 

tii(fi) — 4" 4^31 4" 3ai2 4" fioan 4" ffinOn.^^ 4' 4(a3i + Sflzn 4" 

d’ 3(^22 4" 2 o2ii 4“ aiiii)'n'('a 1 )ms 4* 0(a2ii 4" ~ ~ 

4* ~ 1)(^ ~ 2)(?i — 3)/ii* 


etc. • 
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These can. be written more symbolically in terms of the 

= hinfx\ 

+ hMn — 

M!{/a) = hntii H- 36sin(« - + 6iuw(7i. - l)(n - 2)ni 

liiifi) *= + 4b3in(n — + 3622n(7i — -}- Gbaiiti^^VaMi^ 4- 6?r^^Vi*r 


and in general 


r 


\Vi ' Pi 


Tt 


Pe'/ 


bp*i 


p:*n 


(p) 




The expansion of the function in terms of the power sums of the sample demands 
the determination of the o's. This can be accomplished b3' solving the equations 


a\ == bjt 
di 4" till = h% 




bn 


ttj 4" Sail 4* am — ha 

Oai 4" am = hai 
Rill = hm 


Qi 4- 4a3L 4“ Som 4“ 4" Oiiu = ^4 

I 

nai 4“ Soun 4- aim “ hsi 

(hz 4" 20211 4~ aim = biz 


The solutions are 

Oi =■ 6i 


etc. 



On *= bji 

Ra ^ hj — 3621 4* 2bni 

Oai ^ 631 — 6m 


dill = 6m 

04 = 64 — 46 gi — 3654 "b 126211 — 661111 
® 3 i = 6gi 36211 "b 261111 

P 22 ~ 6 j 3 . 26211 4" him 

oaii = 6211 — 6uiv 
ami = him. 
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The values of , at least for r ^ 4, follow the law 


a, 




(-ir* (p -1) I b,T 



and 


(hi = where \(h.ai^ indicates that — 62 “ bu is multiplied by ai = 61 , 
the rule of m.uItLpliGa tLQn being suffbcing of subscripts. Similarly Om - loS = 
— ba) (bz — bn)* ~ ^22 — 2 baii -b bim- 
This statement illustrates a general theorem which will be established later 
in another paper by a different approach that for all cases 


and that 



{-irHp - Dib.fi 


r 

Vf 


I 


€if • ■ ' p 

t u 




This theorem enables one to write, with comparative ease, the coefficient of 
any product of power sums in a sample function whose expected values is defined. 
For example the functional coefficient of (3) (2) in/g is 

la^aj — l(ba — Sbai + 26iu) (b* — bu)* = — bgn — 3b22i "b 5b2iu — 2!?iiiii 

while that of (3)(1)(1) i.s lagniail = bju - 3b2ui + 2biini. If the expected value 
of the function is known the b’s are determined and the values of the above 
expressions can be found by substitution. 


17. The Values of the Fisher Moments {k functions). The fc functions have 
been defined to be the.se functions whose expected values are the Thiele moments 
of the universe. Thus fnih) = Xr and since 






P^*J 




it follows at once that by comparison with in the last section, that 


b«f 1 


3 • 1 1 « 




(p- l)i 


Thus 


hi = -J b2 = 
n n 


bn= - 


TO 




1. 1 I, -1 >. 2 




-1 


-1 , 2 

^ 22 =*^; 0211 = ;^; 


J. * 
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The insertion of these values in the formulae of section 16 gives the values of 
fl such M those indicated in Table I and in section 5. Thus the coefiicicnt of 
(3) (2) in U is 

10(6s 2 — bau ” Sbsai + 5b2iii — 2biiiii) = “ 10 -j- ^ ^ 

" {n- l)f')‘ 


The coefficient of (3)(1)(1) is 

[ 2 18 48 "1 

^ ^ J 


10(271 + 4) ■ 
{n — 1)<^^ ' 


18. The k Functions. It is also possible to define a function whose expected 
value is the moment of the univei'se. Thus Mi(hr) — Hr where 


and 


fir 




I 


1 if s = 1, TTi = 1, and pi = r. 

(—1)'* if Pi > 1, TTi = 1, s = 2 and pa = 1. 
(-1)'^"* (r " 1) if Pi = 1, s = 1, and n = r, 
1 0 in all other cases. ' 


Comparing with the value of tii(ff) in section 16 we have 




il* - 


J»r 




n 


(fi) 


The substitution of these values of b in the results of section 16 gives the expan- 
sions of hr in terms of power sums ns illustrated by the formulae of section 6 
and Table I. Thus the coefficient of (3) (2) is 

10(&32 — bjii — 36221 5&2111 — 2 biiiii) 


n»> ^ nW + 

Similarly the coefficient of (3)(1)(1) in Ag is 


-10(?i - 2) 

{n - l)t'J ■ 


10(6311 - Sbjui + 2611111) = 10 [4) + “4 + 4)1 = - 4 ?i. -h 8) 

19 , The h Functions. One line of attack calls for the introduction of new 
moment functions which will result in simpler formulae. Thus for example, 
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C. C. Craig 'wrote (2, 37) ^‘It rather seems that the best hopes of effectively 
further simplifying the problem of sampling for statistical characteristics lie 
either in the discovery of a ne-sv kind of symmetric function of all the observa- 
tions which may be used to characterize frequency functions and which will 
be more amenable than either moments or semi-invariants for use in sampling 
problems, or in, what may very well prove to be much better and more 
feasible, the abandonment of the method of characterizing frequency functions 
by symmetric functions of all the observations altogether/^ 

R. A. Msher has shown that it is possible to introduce symmetric functions 
which do simplify the resulting formula appreciably. It is the purpose of this 
section to introduce an additional symmetric function which simplifies the 
resulting formulae to a much greater extent. It is admitted that this function 
does not have all the properties (such aa invariance with respect to change of 
origin) possessed by the Thiele and Tisher functions, but it does not have the’ 
property of making the resulting formulae simple. It also has the advantage 
that mCAJ) = /(/*;)• 

The basic idea is to find a sample moment function whose expected value is 0. 
A first attempt, placing every h = 0, is of no avail since every a is also equal 
to 0 and there is no function. A second attempt is based on the idea of finding 
the function k whose expected value is m . If the universe is assumed to be 
measured about its mean, as is conventional, it follows at once that jui = 0 
and /iiCAr) = 0 so that 

llpy(Jlfy I /trj) ” I ^n)' 

This function then has the property that its moments about a fixed point and 
its moments are identical. 

In order to discover its expansion in terms of power sums, we note 

fiiihi) = fii 

and it follows at once by comparison with /ii(/r) in section 16 that &jr - ^ 

and = 0 in all other cases. The a’s are determined in the usual 

way. Thus 

0, = b, - bu = - 

so that 
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Similarly 

hz = 4i [2<3) - 3(2)(l) + (1)*] 

7j,w) . 

I 

ft! = - 4 [6(4) - 8(3)(1) - 3(2) (2) + 6(2)(1)(1) - (1)'] 

and in general 

hi = (-!)'■' t(p. - 1) ir' t(p» - 1) ii" • • ■ [(P. - 1) 11'' 

' ’ • • P- 7 ) 

In order to show the simple form in which results can be given we substitute 
the values of the b'B in the results obtained above. Not only does ^i(hr) — 0, 
but by section U 



Mi 


= MiChi) = ^2(^2) = 

Xlllhg , hi) = ^u(X5 , ^2) — JUllC^S , X2) = 0 


X 4 (Aa) = Miihz) — MiO^a) =» 


6 


n{n — 1) (n — 2) 


3 

Mi 


while from section 15 


X 3 j(A 3 , ^ 2 ) = Miiihs/hs) = Msiihsthi) = 


36 MiMi 




36(71 — 3) ^i2 


n\n - l)®(w - 2) n\n — 1 ) 2(71 — 2)’ 


I 

It is to be noticed that these formulae contain very few terms and that the 
terms themselves involve very low moments of the universe. This simplicity 
has been attained without making any assumption such os normality, regarding 
the nature of the universe. 


20. Table of Values of h for Different Functions When r < 6. This process 
of defining functions by means of expected values could be extended indefinitely. 
Perhaps it has been applied to enough functions to suggest the breadth of the 
applicability of the theory developed in Part I and Part III. 

As the &’s are the quantities which are used in the formulae I have provided 
Table III giving their values for the six functions, ml, h, K, K when 

r - 1, 2, 3, 4, 6. When the a's are known, the b's are computed from them 
according to the formulae of section 16. 
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' TABLE III 
TaitiGs oj the h’s for r ^ 5 


NumJ 

OOGf. 

■ 

1 

tJJr 

ir 

K 


f 

h 

r 

1 

6i 

H 

1_ 

1 

1 

1 

1 

n 

n 

n 

n 

n 

n 

1 

h 

1 

n 

n — 1 


1 

n 

1 

rt 

Q 

1 

in 

0 

1 ~ 

1 

1 

1 

1 

II 


7^® 

li(s) 

nti) 

n<« 

1 

h 

' 1 

(rt — 1) (n - 2) 

(71 — 1) (rt ~ 2) 

1. 

1 

0 

n 


rt® 

n 

rt 

3 

m 

0 

(n-2) 

rt — 2 

1 

u<J) 

1 

“ rt<>) 

0 

i 

t 

0 

2 

2 ‘ 

2 

2 

1 

1 

bill 



' ft<»> 



1 

6* 

1 

(n - 1) (n> - 3n + 3) 


1 

1 

0 

n. 


n* 

71 

n 

4 

B 


-Zn + Z) 

71* 

(n* — 6rt 4* 0) 
n* 

■ jiW 

1 

1 

rt«> 

0 

3 

^39 . 

0 

271 — 3 
n* 

(n* ^ 4n “f 6) 
n* 


0 

1 0 

6 

6]11 

0 

rt ' 3 

n* 

2(rt - 3) 

rt* 1 

2 

tv«) 

J_ 

0 

1 

6)111 


_ i 

6 

B 

8 

V 

n* 

i n* 



rt<®> 

1 

6b 

1 

(rt-l)(tt-2)(n*-2rt + 2) 

(rt-l)(n-2) (rt»-12rt+12) 

1 

1 

0 


n® 

rt® 

n 

n 


bii 

I 

(rt® — 4rt* + On ~ 4) 

(rt® - 14rt“ + 36rt - 24) 

•1 

- J, 

0 

n® 

rt® 

ftO) 




0 

— 471 + 4 

(rt® - 8rt» + 24rt - 24) 

iBWBM 

0 

n 

71® 


l|^g 

u 

10 

6»ll 

0 

77“ — 371 -|- 4 

71® 

2rt* -1871+24 

71® 

rt^*^ 

_1^ 

rt^®> 

0 

16 

6sn 

1 

0 

_ 2(71 - 2) 
n® 

2rt* - 12n + 24 

71® 


0 

0 

1 


,6im 

0 

» - 4 

rt® 

6 (rt - 4) 

rt' 

- A 

7J,W) 


0 

1 

6]jii[ 

I 

_ _ __ 

4 

rt® 

rt® 

ftW) 

B 

-L 

ft(i) 

_ _ 
















































































































































44 


PAUL S- DWYER 


Part III. Combinatory Methods 


21 , Partition Representation of Expected Value of / Functions. The formu lae 

lti[(/3) = &371^3 + 3b2in(n — 1 )m2Mi “ i)(^ ~ 2 )mi^ 

fiiifi) — hnii'i + 46ci«'(»i ~ + 3biin(ft — 1)^2^ 

4 ' 6bi\ifi(n — l)in — 2 }fi 2 fit^ 4 - 
are ‘'synthetically” given by the column partitions 


1 

2 1 

1 

3 2 

1 


4 3 

1 


1 
1 
1 

2 2 

2 1 

1 


1 

1 


The partition parts represent both tlic subscripts of (he moments and the 
subscripts of the b’s. If p indicates the number of parts, the n multiplier 
is The numerical coefficient is obtained by taking the sum of the entries 
in the column (the weight) and dividing it by the factorials of all entries times 
the factorials of all repeated entries jus imlicatecl by 


r 





r! 


(pi (p2l)’^® •/ • (p* 0^' TTlI ir2 1 ■ ■ • ITe I 


The translation from the synthetic partition form to the expanded f(n'm is 
accelerated if the coefficients am known, These are provided in the following 
partition representatioj) of the formula for p!(/r) when r ^ 8 and- the results 
are expressed in terms of the moments of the universe 

0 

mI(/2): 1 

2 

1 

3 
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1 3 

4 2 
2 

Mi(/ 6 ) ■ 1 10 

5 3 
2 

mIC/a); 1 16 10 16 

6 4 3 2 

2 3 2 

2 

1 21 36 106 

7 5 4 3 

2 3 2 

2 

1 28 56 36 210 280 106 

8 6 5 4 4 3 2 

2 T 4 2 3 2 

2 2 2 
2 

The proper formula can be stated immediately from its synthetic representa- 
tion. Thus for example 

+ 156«n(n — 1)^442 + — l)j^a 

d- 1562j2n(«. - !)(«■ — 2)/ia* 

22. Partition Representation of the Expected Value of a Product of f Func- 
tions. Two column partitions may be used similarly to represent the expected 
values of the products of two /’s, three column partitions for the expected value 
of the triple product, etc. In order to obtain all terms it is only necessary to 
combine every partition of each / in every po.ssible way. The .synthetic repre- 
sentation of ISinii, nil) is 

112 1 

21 20 II 10 

01 10 10 

01 

The sum of the entries in each row indicates the proper moment while the 
number of rows indicates the number of parts as in the preceding section. 
The n coefficient associated with a p rowed partition is then . The b coeffi- 
cient is indicated by the columnar entries. Thus 

Pu{f 2 ji ~ babiWAtSd- [&261 + 2bnhi]n{n - -}- hnhinin - l)(tt -. 2 )^?. 
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4& 

We verify this by the algebraic method 

Mh,h) - M2) + 

= i?hai(2)(l) + 

= <hai[nii3 + n{n — 

+ "b 3tt(w — ~h nin — l)(w — 2)/ii^] 

= (fla + + (flj + 1 )m2Mi 

+ 2auaiM?/Ji + anaMn - l)(n — 2)jul“ 

= hihififis + ~ + 2bnpi'>^{n ~ 

+ hnhnin - l)(n - 2)^!® 

as indicated. 

It thus appears that the partition representation is a mnemonic device for 
indicating the sointion as obtained by the algebraic method. A move formal 
justification is based upon the property that if 

EiS^) - 6,(2) + 6u(l)(l) and EiJ,) - 6v(l) 

then Eift.,f]) can be obtained by a symbolic multiplication of ha(2) + 6ii(l)(l) 
by 6i(l) where the 6’s are multiplied but the power sums are collected in all 
possible ways. Thus 

EUj./i) - w.l(3) + (2)(1)1 + bnl>.[2<2)(l) (l)’l 

which gives 

* 

- 6abiW;tS + bjbin(n - \)ii2n\ + 2hiihin{n — 1)m2p! -h biibin^^V!'* 

os before. 

This symbolic multiplication is generally true and serves as the real algebraic 
justification of the partition representation. It will be established in a later 
paper dealing with the more general case of a finite population. The general 
type of partition analysis has been used previously by Fisher (3) and Georgescu 
(4), Each has established it through analytic rather than algebraic means. 

23 . Determination of the CoeflScients, Methods of determining the numerical 
coefficient have previously been given by such authors as Fisher (3), Wishart (5) 
(7) and Georgescu (4). If the/'s are of different weight, the coefficients of any 
partition (an interchange of rows is not looked upon as changing the partition) 
is given by writing in the numerator the factorials of the different r’s and in 
the denominator the factorials of all the different entries and the factorials of 
all repeated rows. Thus the coefficient of 


4!3Uf 
2 1 (11)^21 


= 72 . 


210 

111 is 
111 
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In case two or more functioris have the same weight additional equivalent 
partitions are formed by interchange of columns. The reader is referred to the 
above papers for rules for determining the coefficients in the more involved 
cases though the coefficients are presented for ail the two way partitions of the 
next section. 

An alternative method of finding the coefficients is that given by C, C. 
Craig (2, 24-25) since it appears that the symbolic formulae used in the present 
paper are essentially his formulae for v’s in terms of W For example his for- 
mula for ( 2 , 22 ) is given symbolically by the formula for 44 in the next 
section. The only difference revealed is that the subscripts- of the X’s are read 
by rows rather than by columns and that they are sometimes interchanged. 
The more precise formulization is needed for the present interpretation although 
it is not needed* for Prof. Craig's purpose, 

A third method utilizes the symbolic multiplication process stated in seC' 
tioii 22 . Subscripts of the b's are used to indicate which power sums are col- 
lected. Thus [ 1 ) 2 ( 2 ) d- bji(l)(l)]^ gives 

62b2(4) d~ b2obD2(2)(2) -f- 2[2b2C(bii(3)(l) -f b2tnbflii(2)(l)(l)] -h 2biibii(2) (2) 

4binybiM(2)(l)(l) -b biiDoboom(l)(l)(l)(p 

where the underscored terms indicate the products given by [ 62 ( 2 )]*, 2 [b 2 ( 2 )] 
Ibii(l)(l)], and [bn(l)(l)f respectively. This is represented by 


1 

1 

4 

2 

2 

4 

1 

22 

20 

21 

20 

11 

11 

10 


02 

or 

01 

11 

10 

10 




01 


01 

01 







01 


The underscored terms are the only ones remaining when fii = 0 . 

This method is especially useful when a large number of formulae are to be 
computed, as in the next section, 

24. The Partition Representation of Formulae of Total Weight ^ 8 . Tlie 
partition representation of when r ^ 8 are given in section 21 . The 
partition representation of the remaining formulae of total weight ^ 8 , which 


do not contain 

unit parts, 

are given below 

22 1 

1 

2 


22 

20 

11 



02 

11 


32 1 

1 

3 

6 

32 

30 

12 

21 


02 

20 

11 
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1 

1 

8 

6 

4 

6 

3 

12 







42 

40 

31 

22 

30 

21 

20 

20 








02 

11 

20 

12 

21 

20 

11 













02 

11 






33 

1 

6 

9 

1 

9 

9 

6 








33 

31 

22 

30 

21 

20 

11 









02 

11 

03 

12 

11 

11 













02 

11 







222 

1 

3 

12 

6 

4 

1 

6 

8 







222 

220 

211 

201 

111 

200 

200 

no 








002 

on 

021 

in 

020 

Oil 

oil 












002 

on 

101 






62 

1 

1 

10 

10 

6 

10 

20 

10 

20 

15 

60 




52 

60 

41 

32 

40 

22 

31 

30 

30 

12 

21 





02 

11 

20 

12 

30 

21 

20 

11 

20 

20 











02 

11 

20 

11 



43 

1 

3 

12 

6 

1 

4 

12 

18 

12 

3 

IB 

36 

36 


■43 

41 

32 

23 

40 

13 

31 

22 

30 

03 

21 

12 

21 



02 

11 

20 

03 

30 

12 

21 

11 

20 

20 

20 

11 










02 

20 

02 

11 

11 

322 

1 

2 

4 

12 

3 

1 

4 

6 

12 

12 





322 

320 

311 

221 

122 

022 

301 

220 

121 

211 






002 

on 

101 

200 

300 

021 

102 

201 

111 





1 

2 

6 

12 

12 

12 

24 

12 

24 






300 

300 

102 

021 

201 

in 

210 

120 

111 






020 

on 

020 

101 

020 

on 

101 

101 

101 






002 

on 

200 

200 

101 

200 

on 

101 

110 





62 

1 

1 

12 

16 

6 

30 

20 

16 

20 






62 

60 

51 

42 

50 

41 

32 

40 

31 







02 

11 

20 

12 

21 

30 

22 

31 






16 

30 

120 

46 

10 

60 

120 

90 


16 

90 




40 

40 

31 

22 

30 

30 

30 

21 


20 

20 




20 

11 

20 

20 

30 

12 

21 

21 


20 

20 




02 

11 

11 

20 

02 

20 

11 

20 


20 

11 




02 11 
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53 


44 


422 


1 

3 

16 

10 

1 

16 

30 

10 

6 

30 



53 

51 

42 

33 

50 

41 

32 

23 

40 

31 



' 

02 

11 

20 

03 

12 

21’ 

30 

13 

22 



16 

60 

90 

16 

30 

10 

30 

60 

90 

90 

46 

60 

40 

31 

22 

13 

31 

30 

30 

30 

12 

21 

20 

20 

11 

.11 

20 

20 

20 

03 

21 

12 

21 

21 

20 

11 

02 

11 

11 

20 

02 

20 

02 

11 

20 

11 

n 

11 











02 

11 

1 

12 

16 

8 

48 

1 

16 

18 





44, 

42 

33 

41 

32 

40 

31 

22 






02 

11 

03 

12 

04 

13 

22 





6 

96 

36 

72 

48 

16 

72 

144 

9 

72 

24 


40 

31 

22 

22 

30 

30 

21 

21 

20 

20 

11 


02 

11 

20 

11 

12 

03 

21 

12 

20 

11 

11 


02 

02 

02 

11 

02 

11 

02 

11 

02 

11 

11 










02 

02 

11 


1 

2 

4 

16 

6 

4 

8 

4 

24 

16 



422 

420 

411 

321 

222 

401 

320 

122 

212 

311 




002 

on 

101 

200 

021 

102 

300 

210 

111 



1 

16 

6 

12 









400 

310 

220 

211 









022 

112 

202 

211 









1 

2 

16 

32 

12 

3 

24 

24 

48 

48 



400 

400 

310 

310 

202 

022 

211 

220 

211 

121 



020 

on 

no 

101 

200 

200 

200 

101 

101 

200 



002 

on 

002 

on 

020 

200 

on 

101 

no 

101 



8 

16 

12 

24 

12 

16 

48 

96 

24 

24 



300 

300 

210 

021 

120 

300' 

201 

210 

111 

210 



120 

021 

210 

201 

102 

111 

120 

111 

111 

201 



002 

101 

002 

200 

200 

on 

101 

101 

200 

on 



3 

M 

6 

48 

24 








200 

200 

200 

200 

no 








200 

, no 

200 

no 

no 





■ 



020 

no 

on 

101 

101 








002 

002 

on 

on 

101 
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332 

1 

1 

9 

12 

6 

2 

18 

18 

6 

12 



332 

330 

232 

321 

312 

303 

212 

221 

320 

311 




002 

no 

Oil 

020 

030 

120 

111 

012 

021 



2 

9 

18 

6 



1 






301 

220 

211 

310 









031 

112 

121 

022 









9 

18 

•6 

12 

12 

18 

9 

72 

18 

36 



220 

220 

310 

301 

310 

202 

112 

211 

112 

211 



no 

101 

020 

020 

on 

no 

200 

no 

no 

101 



002 

on 

002 

on 

on 

020 

020 

on 

no 

020 



1 

6 

12 

9 

18 

36 

36 

18 

36 

72 

36 


300 

300 

300 

210 

210 

210 

201 

201 

310 

210 

111 


030 

012 

021 

120 

102 

012 

111 

021 

101 

111 

111 


002 

020 

on 

002 

020 

no 

020 

110 

021 

on 

no 


9 

18 

36 

6 

36 








200 

200 

200 

110 

no 

1 







110 

101 

no 

110 

no 








020 

on 

on 

no 

101 








002 

020 

on 

002 

on 







2222 

J 

1 

4 

24 

24 

32 

3 

24 

8 





2222 

2220 

2211 

2201 

2111 

2200 

2011 

nil 






0002 

0011 

0021 

0111 

0022 

0211 

nil 





6 

12 

48 

96 

48 








2200 

2200 

2011 

2011 

nil 



. 





0020 

0011 

0011 

0101 

1100 








0002 

0011 

0200 

Olio 

0011 








24 

48 

96 

16 

48 


32 






2001 

2010 

2100 

0111 

1011 

1011 

0111 






0201 

0201 

oni 

0111 

1110 

0111 

1101 






0020 

0011 

0011 

2000 

0101 

1100 

1010 






1 

12 

32 

12 

48 








2000 

2000 

2000 

1100 

1100 








0200 

0200 

0101 

1100 

0110 








0030 

0011 

Olio 

oon 

oon 








0002 

0011 

0011 

oon 

1001 
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25. The FoTtuulae for the Sample Momeiits about a Fixed Point in Terms 
of the Moments of the Universe, The partitions of section 21 and section 24 
can be immediately interpreted to give the formulae for the moments of the 
sample function. For example 

■{" (l> 3&2 362162 d' 662i6ii)n(n. ~ I)n 3 iu 2 

and the value of U 2 i(/a , /a) as given in section 15 can be read by inspection. 
The value of the 6 's are to be inserted for any specific function. The coeffi- 
cient of III in the expansion of M 3 (/ 2 ) is 

(62 + 662611 86ii)n(?i ~ l)( 7 i ^ 2). 


In case/2 “ wij, 62 = 


n. “ 1 


n 


s » 


and 611 


so that the coefficient is 


{n 2 ) {fi — Sn’' + “15) 


as indicated previously by Tchouproff ( 10 , 192) and Churcli (9, 82). 

The partitions of section 21 give the 8 fonmilae /Ur . w which Tchouproff 
gave (10, 155). In this case/r = ml and every 6 is 0 except those having single 

subscripts and tliese equal 

The partitions of section 21 give the formulae Pt.w which were given by 
Tchouproff ( 10 , 186). In this case it is only necessary to take fr = nir and to 
give the b’s the proper values. Tchouproff has arranged his results according 
to decreasing powers of n. As nn illustration we derive his result for vt . (») - 
^ 1 (^ 4 ). From section 21 

^ bitifii + 3ba2n(?i — 1 )m2 


and from Table II 


so that 



64 


{n — 1 ) (n^ —3n 3) 

n* 


and 622 


2?i - 3 
n* 



= JU4 + - “ 4 ^ 4 ) — i (15^2 - 6 ^ 4 ) + A — 3/i4) 

n n n.’' 


os indicated by him. 

The partitions of section 24 also give formulae which have appeared before. 
For example the partitions 

1 1 
22 20 

02 
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whioli syjnbolizfi the forixiula 

Ms(/2) “ hlnm + (&s “h 26ii)?i('n *- l)Ma 

become 

~ ^ '• 3 " Kii' — “h ~ 2n '4' 3)fii] 

H/i 

which was early derived by "Student” (8, 3) and Tchouproff (10, 192). Simi' 
larly the partitions of 222 and 2222 give the formula for and niinh) and 
which were given by Tchouproff (10, 192^193) and Church (9, 82), 

Sections 21 and 24 can then be used to write the moments about a fixed 
point of a sample function in terms of the moments of the universe, In the 
case of new functions the b’s must first be determined. Formulae involving 
unit columnar partitions are not included. If the formulae were desired in 
terms of moments about a fixed point of the universe, it would be necessary 
to write in addition all possible partitions. See for example the last formula 
of section 23. 

26. The Formulae For Moments of Any Sample Functioii in Terms of Mo- 
ments of the Universe. The partitions of aectior^ 21 and 24 are also useful in 
writing the formulae for the moments of the sample moments. It is necessary 
to make the usual adjustments in changing from moments about a fixed point 
to moments: 

M) - «(/,) - 1‘iVr) 

/^nC/ri,/ra) #*lo{/ri , /r,)M0l(/.j ) /r,)* 

The particular two way partitions which are involved in this adjustment are 
immediately recognizable. They are the ones which have an entry which is 
the only entry in the row and in the column in which it is. Thus 3 gives 

220 

002 

one of the terms contributing to ju^Cfa) In addition its coefiioient is the 

sainej if sign is not considered, as the coefficient of ms(/!!) in the expansion 
of hdh) in terms of moments of fi . This has to be true since each is the number 
of ways of forming 220. And so in general the remaining function of n aecom- 

002 

panying this adjustment is the product of the coefficient associated with 22 
and that associated with 2, The sign is plus when odd numbers of momenta 
are multiplied and minus when even numbers of moments are multiplied. 
Hence 3 contributes — Sn’^&a to the adjustment to moments and the total 
220 
002 

contribution of 3 to the value of /is(/s) is Zhl[nin - 1) - n^] ^ -36sW. More 
220 
002 
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extensive study leads to the following general method of using tlie formulae of 
section 24. 

A. Write the coefficient of every two way partition according to section 25. 

B. Block off each single entry by drawing a line through its row and column. 
For example 

6 

, 

092(1 

Xj^jXTZt 

The resulting partitions, 22, 2, 2 are called component parts, 

C. Form new partitions by eliminating component parts one at a time, two 
at a time, three at a time, etc. from the original partition in all possible ways. 

D. Form the coefficient of the resulting parts according to the metliods of 
section 25. Multiply by (—1)*”’ where s is the number of resulting parts. 
Tho values of b will not change. 

E. Multiply in addition by s — 1 when the component parts are all taken 
separately, ^ 

6 

As an example we find the contribution of the partition 2200 to the value 

0020 

0002 

of fiiifi). It gives 

Qbl[n(n ^ l)(w — 2 ) — Sn^{n — 1 ) + 2n\iH2fi2 — I2nb2innl. 

Similarly 1 contribn 
2000 
0200 
0020 
0002 

— 47in*^’ + Qn^{n — 1) — Zn*]fil = 362(?i — 

We use the method in finding the coefficient of in the expansion of 
We find first the coefficient of juj in the expansion of (izifi). It is indicated by 
the partitions 


1 

6 

8 

200 

200 

no 

020 

on 

oil 

002 

on 

101 


so that the^ coefficient of fA is 

hl[n(n — l)(7i — 2) “ Sn{n — 1) + 2n^] + 6b2bn[n(n, — 1)(« — 2) — n^{n — 1)] 
+ 8buw(n. ~ 1)(» — 2) = 63(2n) + Obabjif— 2rA + 2n) 

4 Sbiv?t(n — l)in — 2). 
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71 - j i 2(n ^ l)(n® - I2n + 15) 

When h « find fcjii == ^ this becomes 

previously given by such authors ajs Tehouproff (10, 194), Church (9, 82), 
Carves (Riehasdsoa) (U, 271). 

The general Tchouproff-Church formulae for the third and fourth momenta 
of the variance may he written out in this way as may many other moment 
formulae which have not been printed. 

27. The Thiele Moments of the Sample Function in Terms of the Moments 
of the Umyerse. It is possible also to write the Thiele moments of the sample 
function in terms of the moments of the universe. The technique is very 
similar to that of the pievioua section. The basis of the traneforma-tion is 
now the formula for Thiele moments in terms of moments about a fixed point 
rather than moments in' terms of moments about a fixed point. The results 
are the same as those of the last section when a double or a triple product of 
fs is involved, but they differ with the introduction of a larger number of 
products. The paiUtions having component parts ate broken up into these 
component parts as before but the parts are combined in all possible ways, 
MultvpUera ate determined as before with the exception that there is a multi- 
plication by (— l)*"’^(fi ^ l)i where s is the number of resultant parts. Thus the 
2000 

term 0200 contributes 1)'^ -h 12n^(Ti — 1) 6n‘‘]M« = 

0020 . 

0002 

-ebstt/jj to the value of XiC/j). 


2S, The Moments About a Fixed Point of the Sample Function in Terms 
of the Thiele Moments of the Universe- We return to the problem of section 
25, only we wish to express the results in terms of the Thiele momenta of 
the universe. We must use the formulae of section 12. , 



where p,- ^ 1. 

Thus will contribute to all partitions of r and inversely the contributions 
to a given partition are composed only of these terms which are obtained by 
combining the different elements of the partition. Since the numerical coeffi- 
cient in the expansion of |i, is the number of ways in which the r units can 
be collected to form the partition, it follows at once that the complete X coeffi- 
cient can be obtained by grouping the parts of the partition in all possible 
ways, determining the coefficient of each according to the methods of section 25, 
and adding. In, this way the formulae of section 21 can be used to give expan- 
sions iti terms of partition momenta. For example the representation of /^jf/a) 
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1 16 10 16 

0 4 3 2 

2 3 2 

2 

gives at once 

+ 15[bfln + 5«7i(7i — 1 )]X 4 X 2 + 10 [ 6 fln + b3j?i(7i — 1 )]X 3 

+ i5[bi,n + Bbiin(n — 1) + 6mn(w — 1)(« ~ 2)]\i. 

The partitions of section 21 can be made to give the formula Mi(^r) which 
were given by Thiele ( 1 , 45-46). For example the formula for is indi- 
cated by 

1 3 

4 2 

, 2 

. so that 

mIC/0 - + h2min - 1)]X2 

and since 

, (n — 1) (w* — 6 » + 6 ) j 1 “ 3 

64 s= 7 — - and & 2 i 7 — ■ 

, s (fi - 1) (n^ 6 n -h 6 )Xi 6 (n - l)x; 

^3 . 

which agrees with the result as given by him ( 1 , 45). 

The two way partitions of section 24 can be used similarly. This device 
for changing to tl^e X's is due to the ingenuity of R. A. Fisher who applied it to^ 
the case where /r = kr. 

As an illustration we write from section 24 the value of ii 2 (/ 2 ) in terms of \% 
The partition representation 

112 

22 20 11 
02 11 


gives at once 

b^nXt “h [bln 4* hzn{n — 1)]X2 4- 2|6s?jr 4~ h\in{n — 1)]X2 

which agrees with the result of section 12. The other illustrations of that 
section may be written out similarly. 

As a final illustration of this technique we find the coefficient of X® in the 
expansion of Msitifst /s). The partitions are 

2 9 IB 6 

301 220 211 310 

031 112 121 022 
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and the coefficient is 

2[hlhn -{- 63bLi’i(w 1)1 + sKM + hlihm(n “ 1)] 

“f 18(6a64U “h ” 1)1 6[63&2?4 4" h^bz^^n(fi ^ 1)]. 

If the b’s are inserted to form the h'», the firet and last terms become 0 and the 

others give This agrees with the value os given by B. A. Bisher 

nifi — 1)* 

(3, 208). 

29. The Moments of the Sample Function in Terms of the Thiele Momenta 
of the Universe, The partition representations of section 21 and section 24 
Can be used similarly to write formulae for the moments of the sample function 
in terms of the Thiele moments of the universe. It is only necessary to use tlie 
general plan of section 26j but to write the coefficient of every resulting parti- 
tion according to the method of section 28. For example the partition 



gives the coefficient 

bj(?T. -b -f 371^^^ -b — ibt[n +• 3n*(?i — 1) -b n.^(n — l)(n — 2)J 

+ ObfilTl* "b “^^(77 " D] — = hiln^ — 4ri^ 4* 6M'^ 3?i*] = 0. 

30. The Thiele Moments of the Sample Function in Terms of the Thiele. 
Momenta 'of the Universe, The partition representations of section 21 and 
section 24 can. also be interpreted to give the Thiele moments of the sample 
function in terms of the Thiele momenta of the univejse. The scheme is 
similar to that of section. 29 except that the formulae for changing to Thiele 

2000 

moments are used os in section 27. For example the partition 0200 has now 

0020 

0002 

associated with it 

bjfn. + -b 4- 4 ^ — dbjpi* 4~ 3n^(7J. -- 1) 4- n®(7i l)(n — 2)] 

- ^ 1 )' 4 - I 2 bs 1 )] - 6 bS?i* « 0 . 

The appUcal^ion of this method enables one to write the formulae of section 13 
(and others which they typify) with relative ease. It is now possible to com- 
plete the task left unfinished in section 15, We do not take the spa.ce necessary 
to write all the terms of > 2 i(/ 3 , /s) since the lengthy expression can he obtained 
quiffi readily from the representation of section 24. One term, say the coeffi- 
cient of X(j> 2 , is represented by 



MOMENTS OF ISOBARIC MOMENT FUNCTIONS 


57 


1 9 12 6 

330 222 321 312 

002 no on 020 

and gives 

9[&S&2«. -f- blihnin — 1)] + + 6 a 62 i&utt(^i — 1)1 

+ 6t6s6an + bihihin{n - 1)1 

which becomes when ha - h ^ and 1 >m = bji = rr. This 

n(n — 1) ti n(n - 1) 

agrees with the result given by H. A. Fisher (3, 209). 

For simplicity of form it is logical to use this formuUzation of results, Thiele 
moments in terms of Thiele moments, and it has been used by Thiele (1), 
Craig (2), Fisher (3) and Georgescu (4). They however have used different 
sample moment functions. Thiele and Georgescu used the Thiele moments 
of the sample, Craig and Georgescu the moments while Fisher introduced the 
fc function. 

The present di.scussion deals with the coiTesponding partition moments of 
any rational integral isobaric moment function of the sample. The results 
indicated here give many of the results of the previous authors as special cases. 
For example the symbolic formula 44 of section 24 gives the m\(fJ 4 ) of Thiele 
(1, 45), the n) of Craig (2, 67), the «(44) of R. A. Fisher (3, 210) as 
special cases when the formula 44 is given the interpretation of this section. 
Some may prefer the Craig attack (2, 21-35) to the partition method, It 
should be noted that the formulae of sections 21 and 24 can be used in place 
of part of the Craig method. Thus his formulae (2, 22) 

*'80 ~ ^80 “f" 28 ^6(1X20 “b 66 XboXso etc. 

Vn = X44 4" (12 XjjXo; “b 16 XaaXii) "b etc. 

are immediately obtainable from the symbolic formulae by writing X’s in place 
of 6's and by using row, rather than column, subscripts. It is then necessary 
to compvite the values of ... as given by him (2, 16-17, 40) and to insert 
in his expansions of Skiivmj rn) in terms of i^'s. For example 

Vi) =! - [I'M "b (w — l)f'a5 — tii'Mi'oil (2, 32) 

n 

and from the symbolic formulae of sections 21 and 24 

rfic — X30 4 " 10X30X20 

^32 = Xs 2 "b XsqXm “b 3X12X20 4 “ 6X21X11 

V 30 = Xao 

VbO = X 20 
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so that 

, vu) = ~ [V + (n ~ i)\n H" 9^M^s^o + (w — l)(6A?iXti + 3X2A?o)l (2, 30) 
71 

which agrees with that given by Prof. Craig (aside from an obviovis typographical 
error). The insertion of the values of A gives the value as indicated by 
Aii( 7 »a , 7 » 2 ) of section 13 and by the fii-st method of the present section. 

31. Special Rules for the Detenuiftatiofl of the Coefficients in the Case of 
the Fisher and Georgescu Analyses. R. A. Fisher (3) gave a number of simple 
rules which assist greatly in the determination of the coefficients accompanying 
the partitions, Georgescu (4) also introduced special rules for the evaluation 
of the coefficients of the different partitions he used. It is not to be expected 
that all these rules are applicable in the more general case under present con- 
sideration, but the vanishing of such coefficients as that of 2000 leads one to 

0200 

j ' 0020 

0002 

suspect that there might he some rules which are applicable to this general 
case. A sensible method of procedure is to examine the rules of Fisher and 
Georgescu and determine if they hold in the mote general analysis, The special 
rules of R. A. Fisher might be given somewhat as follows. 

A. If a partition has a column with a single entry, that column may be 
eliminated and the factor n"^ introduced. 

B. Any partition having a row with a single entry may be neglected, 

C. “We may exclude any partition in which any set of rows is connected 
to its complementary set by a single column- only,*' 

D. In determining the algebraic coefficient of a partition the “pattern” is 
sufficient and precise entries are not needed- Thus the partitions 21 and 35, 

11 42 

although they have different numerical factors, have associated with them the 
same function of n. This value is indicated by the pattern xx which has asso- 

XX 

ciated witli it the function -As a result of this property Fisher was able 

to provide a table (3, 223-226) of useful patterns which is of great assistance 
in writing the value of the coefficients. 

E. Formulae of moments of fc functions involving h can be derived from 
corresponding formulae not involving ku "The effect upon the corresponding 
formula of adding- a new unit part to the partition, is (1) to modify every 
term in the formula, by increasing the suffix of one of its k functions by unity 
in every possible way, and (2) to divide the whole by n.” (3, 206). 

Two of the important Georgescu rules may be stated. 

A^ , The numerator function (aside from numerical coefficient) is not altered 
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if column?! arc changed to rows and vice versa. Thus the coeflR,cieiit of fij in 

"iSa ' coefficient of $1 in <5(2^) is Georgescu 

tJV d- 1 } (iV “T-lr 

htus replaced nhy N + 1. 

B'. All partitions which can be broken up into component parts have coeffi- 
cients of 0. This is extended to include all partitions which have as component 
parts other partitions. Thus 


2100 

1100 

0012 

0034 

has a coefficient 0 as does the equivalent 

2010 

1010 

0102 

0304 


'32. Special Rules for the Determination of the Coefficients in the More 
General Case. In the more general case we have 

A. If a partition has a single column with a single entry, c, that column 
may be eiiminated and the value b* inserted as a multiplier. This is imme- 
diately evident since the contribution of that column to each term in the 
expansion is be times its value if the column were eliminated. 

B, The coefficient of any partition having an entry which is the only entry 
in its row and column, is 0. 

This rule, which saves considerable labor in that it makes unnecessary the 
computation of the coefficients of many of the partitions of section 24, is estab- 
lished in this way. Without loss of generality the partition may be repre- 
sented by 

Cii Ci2 Ci3 * • • Cit) 0 

CJI C 22 023 • ■ • few 0 

ITtt+l.fl+l = fel C32 C33 • ■ » fev 0 


Cui Ch2 Cu 3 • " ' Ouw 0 

' t 

0 0 0 0 t 

and 7 r„ may represent the partition containing the first u rows and the first v 
columns. We determine the coefficient of 7r«+i,fl+i in terms of the coefficient 
of TTu . V- Consider first any grouping of the u rows of ti. ^ ^ into w rows. There 
will be v> corresponding groupings of in which the last row is added, in 

turn, to each of the w rows and another w -f- 1 rowed term in which it is not 
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added. In each of the first w cases the coefficient by rule A is multiplied by 
111 case of the w -j- 1 rowed partition the coefficient is multi- 
plied by and is ret)laced by A final adjustment takes 

care of the transition from the moment about a fixed point of the sample 
function to the Thiele moment of the sample function. This adjustment de- 
mand.s the multiplication of the coefficient of x« by n and the sub- 

traction from the sum of the other terms* If Bn, is the coefficient of the w 
rowed form, it follows at once that the corresponding coefficient is 

[wn'-> + - Bn'"’] = 0. 

This holds for the expansion of any termi of x„ , v and hence the coefficient of 
TTu+i.iif-i is 0. Of course the argument holds if the partition has more than 2 
component parts. 

It thus appears that this rule holds not only for kr and Wr as Fisher and 
Georgescu have noted, but for/r. 

C. The coefficient of any partition which can be broken into component 
parts is 0* In this sense a component part is any group of rows or columns 
which have no entry in common with any other group of rows or columns, 
It corresponds in matrix language to a matrix which results when one matrix 
is zero bordered by another matrix although rows and columns may thereafter 
be interchanged. 

The proof of this more general case follows the general line of the simpler 
case although the reasoning is more complicated. For example the coefficient of ' 


Cii 

C |2 • • • 

Cjv 

0 

0 

C 2 I 

C 21 • • • 

C 2 ti 

0 

0 

C 31 

C 32 • ‘ • 

C 311 

0 

0 

Cui 

c,a ‘ • • 


0 

0 

0 

0 

0 

Cu-f 1 , v+l 

Cn+1 , 13-1-2 

0 

0 * * * 

0 

, v+l 

Cw+2 , v-f-2 

is 0 since any w rowed term of the x« . „ contributes 


V+^+^CI-H r 

^ + 2 

Iwn'”' + n 

- mb' 


“h I v+i*u+i ■ B+i , n+jeu^.! , [■w(w 1) 71^ ^ -j- ^ "h 

— n{n ^ l) = 0. 

Other special rules of Fisher and Georgescu do not hold in the general case. 
Thus Fisher rule B is not generally true since* the partitions 

12 and 22 

30 20 



MOMENTS OF ISOBABIC MOMENT FUNCTIONS 


61 


have respective algebraic coefficients of bibin -J- buhnin — 1) and 

hihn + bsihzn^n — 1) 

and these are not in general equal to 0. 

The Tisher rule C is replaced by the somewhat less general C of the present 
section. 

The Fisher rule D is not applicable in the general case. Tlie Fisher rule D 
is applicable in all cases in which the value of the fepfi ... j,p is completely deter- 
mined by the number of part.s for in tbi.s case, the particular value of each 
part is not pertinent. We may say then that the Fisher rule D is applicable 
to all cases in which ... ji> is a function of jo, n \YhGTe p is the number 


of parts. Thi.s condition is sativsfied by bpi 


_(-ir(p^i-)! 

^ ^ n .* ^ y 


and the 


^(p) 

coefficients arc worked out for it in FisheFs paper. The same method is 
applicable to other functions satisfying the general condition although the 
values of the coclficients will of course vary with the definition of b. 

The Fisher rule E is not applicable to the general case. Its validity, from 
an algebraic standpoint, depends upon the Fisher property B which is not 
generally applicable, The Fisher rule E as applied to the more general case 
gives correct terms but it does not give all the terms. For example the Fisher 
rule E applied to givc.s 

Tlho application of a corresponding rule to 

hif?) = b2w\4 -h 2(b2'B bjitt(n - I)]X2 

would give 

k2t(/2,/i) “ babiaXg ilbsbiB -h bu6iu(n ”• l)lXaXj 
while the correct result is indicated by 


1 

221 


4 

310 

Oil 


2 

201 

020 


4 

111 

no 


and is 


X2i(/2/i) “ babi^jXB -b 4[b2bi?i 4" b2biibi?t(ft— 1)]X3X2 -b 2lbabi?t- 4- b\bin{n l)iX3Xj 

4” 4[b2bi?r 4" bubiTifn ■ — 1)]X3X2. 
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The difference is due to the vanishing of the two niiddle terms in the case of 
the k functions. 

The rule B', wliich Georgesou found most useful in computing and checking 
his formulae, is. not generally true. It is not even true in the case of the k 
function, as can be discovered by using it on the list given by R. A. Fisher 
(3, 210). It is interesting to note that the Georgescu method, while not being 
able to utilize many of the special rules of tlie Fisher method, does use this rule 
whicli is not in general adaptable to the Fisher method. 

33, Special Rules in the Case of the h! Functions. Special rules can be 
worked out for other sample functions. As an illustration we examine the 

function hf which was defined in section 19. It is recalled that hyp = ~ and 

that 6pp'... p'l = 0 for all other eases. It follows at once that 
A. Any partition having any entry other than unity (or zero) may be 
neglected. 

, B. The value of hiP is i. 

As an illustration we write the value ha). From the partitions of 

section 24 we select 


36 


36 

111 


no 

111 

and 

no 

no 


101 



on 


as being the only partitions making a contribution. The result of section 19 
follows at once. 

34. The Case of a Normal Universe. A normal universe is characterized by 
the relationship that Xr = 0 when r > 2, It follows that it is only necessary 
to compute the coefficients of those partitions giving powers of X 2 . 

Wishart (6) (7) has developed the partition analysis of the fc function in 
the case of a normal parent while Georgescu has studied the corresponding 
m function. It is not the purpose of this section to make extensive study of 
the case of the normal parent but simply to indicate that the results of section 24 
are immediately applicable. As an illustration we write the values of Xi(/ 2 ), 
and Ad/a) in the case of a normal univer.se. The terms are given 
successively, by 


1 

2 

8 

48 

2 

11 

no 

1100 


u 

. on 

0110 



101 

oon 




1001 
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and hence • 

^i(/2) === bswXs 

X3(/2) = 2[bln + b\Mn - ml 

X3(/2) = 8[i>2W "f" “1)4- " l)(jl — 2)]X2 

X4(/2) “ 48[&27t -f- &blhiin{n — 1) 4" hiin(n — 1) 4“ 462biiJt-(?i — l)(w — 2) 

+ 2&)in(w,“ l)(n - 2) + blnin - l)(7i - 2)in - 3)lXi. 

It 18 only necessary to substitute the 6'a to obtain the results for different Values 
of f. This is done in Tabic IV. 

' TABLE IV 


The Jirsi four Thiele moments of /a for various sample functions in ike case of a 

normal universe 


Sample 

func- 

tion 

Mh) 

1 

1 

Xst/i) 

Xd/s) 

ms 

X, 

n 

' 2(»-1),5 
, ^ 

8(n - 1) 

48(n - 1) Xj 

71'* 


Xa 

1 2XS 

8X2 

48X2 

1 n — 1 

(n - 1)2 

(n - ly 

h 

n 

2(»-1),2 
, As 

1 ^ 

8(n “ 1)X2 

1 

48(?i“1)X2 

1 

m[ 

^2 

2\l 

n 

84 

n® 

48\i 

hs 


2\l 

8X2 

48X5 

A2 

71—1 

(» - 1). 

(71 “ 1)® 

' hi 

0 

2X1 

8{n - 2)X2 

48(71^ “ 371 + 3)X2 


7i(n — 1) 

n^in - ly 

n\n — 1)® 


One surmises that the general value of 

Xr(/2) is 2""'(r - 1)! X5B: 11000 • ■ . 0 

01100 • . . 0 
00110 ... 0 

00000 ■ ■ ‘ U 
10000 ... 01 
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wh^re B represents the i coeffi-cLent of the y rowed partition, 
appears consistent with the fact that 


^r+l(^2) 


2V ! 

(n ~ ly 


This induction 


as shown by John Wishart (7). The whole subject of the Thiele jnomenta of 
the general function in the case of a normal universe would make an interesting 
subject of investigation. 


35. Summary and Conclusion. The contributions of this paper include 

1. The definitions of specific moment functions in terms of power sums. 

2. The use of iiidetermiivate multipliers iii representing a general isobar ic 
moment function. 

3. The finding of the expected value of products of these functions by alge- 
braic methods. 

4. The use of tables in writing these expected values in terms of moments 
(or of moments about a fixed point) of the universe. 

5. The finding of the expected values of specific moment functions by sub- 
stitution. 

6. Means of establishing the expansion of new moment functions which are 
defined by their expected values. 

7. The introduction of the sample function of weight r whose expected 
value is ji,. 

8. The introduction of the sample function of weight r whose expected 
value is 

9. The two way partition formulae of weight g 8 which do not involve 
unit parts. 

The use of tlicsc partition formulae in writing: 

10. The moments about a fixed point of ft in terms of moments. 

11. The moments of fr in terms of moments. 

12. The Tliiele momenta of fr in terms of moments. 

13. The moments about a fixed point of fr in terms of Thiele moments. 

14. The moments of fr in terms of Thiele moments. 

15. The Thiele momenta of fr in terms of Thiele moments. 

16. Special rules in the case of Thiele moratots. 

17. The applicability of these results to a given sample moment function 
and hence the derivation of varied results, of such authors as Thiele, Tchouproff, 
Church, Fisher, Craig, and Georgescu, from the same partition formulae. 

18. The simplicity of the formulae when hr is used os the sample function. 

19. The application of the synthetic formulae to the Craig method, 

20. The applicability of the theory to a normal universe. 

The introduction of such general procedure opens up a wide field for future 
study, It is impossible in a single paper dealing with so broad a subject to do 
more than to outline the general scheme by which two way partitions can be 
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used fls a central formulizatioii'of the various formulae for moments of moments. 
More detailed proofs and more extensive analysis of the more important of the 
special cases will undoubtedly be supplied by later writers. 

In later papers the author will show how the partition representation can 
be used in the case of multivariate distributions and how it can also be used, 
in connection with the sampling polynomials introduced by H. C. Carver (11), 
to represent the more complex formulae obtained in the case of finite sampling. 

It is obvious that the author is indebted to the classical moment studies of 
Fisher and Craig. He also wishes to acknowledge his indebtedness to Prof. 
Craig and to Prof. Carver who have read tho manuscript and have made 
valuable suggestions. 

Tub Univehsity of Michigan. 
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NOTES 


A COEFFICIENT OF CORRELATION BETWEEN SCHOLARSHIP 

AND SALARIES 

INTnODTJCTION 

t 

Some might clmibt that it is correct to apply a coefRcient of correlation to 
show the relationship between scholarship and salaries. This coefficient can 
be trusted to give at least a rough approximation, which is all that is necessary 
in the inexact science of vocation, It is fictitious accuracy to be too finical 
in the application of formulas. Therefore, a coefficient of correlation between 
scholarship and salaries is a valuable part of human knowledge. 

Would it be worth while to find this coefficient if it is based upon the experi- 
ence of the American Telegraph and Telephone Company? Since the employ- 
ment practices of this company are not representative of the employment 
practices of business at large, one might doubt the validity of drawing general 
conclusions from such specialized data. The coefficient for business at large 
is probably less than the coefficient for the Bell System; the value of this knowl- 
edge is enhanced if we know the latter coefficient. Since this company is very 
large, a coefficient between scholarship and salaries would be, valuable, even if 
this coefficient applies only to the Bell System and to other companies ^having 
approximately the same employment practices. 

An article^ by Mr. Walter S. Oifford, President of the Bell System, contains 
a discussion of some of the relationships between scholarship and salaries. 
President Gifiord, however, did not determine in the case of the Bell System a 
coefficient of correlation between scholarship and salaries. 

The purpose of this article is not a new contribution to statistical method, 
but is an application of the method^ of finding the coefficient of correlation 
when the two variables have not been quantitatively measured. This method 
will be applied to the chart on, page 672 of President Gifford’s article, in order 
to determine for the Bell System the coefficient of correlation between scholar- 
ship and salaries. 


FINDING THE COEFPICUSNT OF CORRELATION 

An explanation of the chart. It is based on the experience of 2,144 Bell 
System employees over five years out of college. First, assume these employees 


Ut ii entitled ^‘Does Busineas Want Scholars?" and was printed in the May 1928 isaiie 
ot Harper's Magaiine, 

^ It can be found in Elderlon'a “Frequency Curves and Correlation. " 
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are grouped according to their grades in college. In the high scholarship group 
put those who graduated in the highest third of their classes. The middle 
and low scholarship groups are formed in like manner. Secondly, suppose the 
same employees are divided into three equal groups according to their salaries. 
Then, the salary of any one of the employees would be high, middle, or low. 

Assume a liypothetieal group of 300 employees rvho are college graduates. 
Suppose that the scholarship of 100 of them was high, that the scholarship of 
100 of them was middle, and that the scholarship of the others was low. Also 
assume that the salary experience of these 300 employees is .the same as that 
of the 2,144 employees of the Bell System. 

The 300 employees can be grouped according to the following tabic. 


TABLE NO. 1 


Salary 

Scholarship 

Totals 

Low 

Middle 

High 

High 

22 

24 

48 

94 

Middle 

31 

39 

27 

97 

Low 

47 

37 

25 

109 

Totals 

100 

100 

— 

100 

300 


This table can be combined as follows. 


TABLE NO. 2 
% 



1 

1 Scholarship 

Salary 




Low 4; Middle 

High 

High 

C 

d 

Middle & Low 

1 

a 1 

b 


Then, c = 46, a = 154, d = 48, and 6 = 52. Assume N = 300. 

Assume a: is a function of grades received in college. Suppose y is a function 
of salaries received. Assume that the frequencies x and y both follow the 
normal curve of error whose standard deviation is equal to one. Also assume 
that the average of x and the average of y are both equal to zero. It is a 
matter of common knowledge that salaries are not arranged in a symmetrical 
fashion; y is not a linear function of salaries. 

In the formulas which follow, ?■ is the symbol for the coefficient of correlation, 
These formulas are applied to Tabic No. 2. We have 

— ^ f dx = ^ = .167, and h = .4316. 
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Also 


— r 

y/%: 


,-*»■ iy = = .187, and h = .4874. 


217 


Then, 


H =. 


^ = .3635, and JC = -i e 


,3543, 


All the c[uaTitities except r in the following approximate equation are known; 




+ ^ “ 3) + (A* — + 3) + 3). 

24 -1*0 


Therefore, 

.0261?® + .0681r* + .1034/ + .1062?^ + r - .4314 « 0. 

Then, r is approximately equal fco ,4061. Consequently, for practical purposes 
we can assume that r = .4. 


28 Doodt Street 
B ntrNswrcs:, Maine 


John L. Roeehtb 


NOTE ON THE DERIVATION OF THE MITLTIPLE CORRELATION 

COEFFICIENT 

Consider N observed vahms of each of n variables. These n^N values may 
be tabulated in a double-entry table as follows; 

Xxi Xi2 Xi3 - ■ Xi^ 

X„ Xn X 23 • * • Xay 


X„t Xna Xna * • ■ Xfiit 

where is the A:*** value of the variable. 

Using the variable as the dependent variable, the general linear relation- 
ship between the n variables may be expressed by 

Xi = ,'^1 ail “b jda ■ 4* 2!t-i “b iac+i "b * • * 4“ idn (1) 

where 


idf is the general parameter which is to be determined empirically; 

Xf~Xj Jlf j ; 

Af; is the aribhmetie mean of the variable. 
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By tlic method of least squares, the constants of (1) must satisfy the normal 
equations: 

(S®5)iai + (Sa:ia:2)<aa + • ■ • + 

+ (2«ia;i-n)iCti+i + •‘’ + - 2)a:ia;< 

(Sa;2a:i),'ai + i'2xl)iai + • * • + (Sa:2a:<^0<«f-i 


d" ( 1^2 d" ■ ’ ’ d^ (SSii— iSJn) <^71 ■“ — \Xi 

{'2Xi+iXi)iai d- (Sa;,>+ia:3()ia2 -f . . . -f (Sa:,'+ia;„)<a„ = Saj^+i®,- 


(■23Jn3'l)i^^l d“ C^®n3'2)ia2 “f" ' ' ' d~ 

where 

(&/*/) = Z - Ml). 

it"! 

But 

(Sa:,®,) = JVri/fffO-,-, / 

(Sa:-) ^ = Nfuom (2) 

where 

is the Pearsonian coefficient of correlation between the and j“* variables, 
ai , the standard deviation of the variable. 

Substituting the right members of (2) in the normal equations, we obtain 
the system : 

n 

2 r\k<rm iOjt = 0 
fc=i 


S ^2JtV2irjfc fOc = 0 

fc"i 


^ r,_i , k (fi-iffk iO>k = 0 (3) 

A-i 

1 

n 

2 ’'«+» . ** = 0 
*“1 


n 

^ j {dh = 0 
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where 


Let 


If 

-1. 

riitriu, 

■ • Tn\<rn(^\ 

h « • » < • 



( 4 ) 


Aii be the first minor of the element ri/cr.-tf/ in A, aA be A with the and 
columns interchanged, and be the first minor of the element in the 
column and row of itjl, 

Solving (3) for idk by Cramer's rule, we find 


A 


But it can easily be proved that 


«4« = 


hence 


= t-1; -T-, 

An 

Using eofactors of A instead of minors, we have 

,a,.(-l) 

f 

Without writing the determinant out in full, we notice that the (t’s can be 
factored out* Hence , 


,ajt = “ 




2 2 


2 2 


Kik 






<y\ K„ 




(5) 


where 



I 
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Using these derived values for the coefficients, we may write (1) in the sym- 
metric form: 

(X, - M ,) + (Xs - Af,) + • . . + (X. - M.) = 0, ■ 

ffi (72 . 0-„ 

or 



For a multiple correlation coefficient, wc use the formula 

2^ I 2mi “h ' ^ iOik^k) ) I 

_ -I t°i L V°i^ ^"<+1 / J 

X A ■ 


r; 




(6) 


which measures the amount of observed dispersion from the regression plane 
in which X,- is the dependent variable. 

Substituting the values for the a's, we find 


El^l- 


tf , 

(KixTi 

w 


l®!,’ . KiiXij . 

H : r 


(Tj 


+ 


Ki„xA ' 

ffrt / 


KhN 


Squaring the bracket expression and using (2) we obtain 



The second sum is the sum of the products of the elements in the row 
by the cofactors of the elements in the row. This sum is necessarily zero 
unless h = i] but if fc = i, this sum is equal to K, 


B' = 1 - {KaK) = 1 


K 



Oreoon State A.GRTcni/ivnB CoiiUege 
School of Science 
C oKVALLie, Oregon 


1 


William J. Kirkkam 
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NOTE ON NUMERICAL EVALUATION OF DOUBLE SERIES^ 


1. The Eulei'Maclaurin summation formula has been extended to two 
variables by Dr. Sheppard,® and Mr, Jrwin,® to determine cubature formulas. 
A more complicated two-dimensional form was given by Baten* involving 
product polynomials, for which a remainder term was also calculated. The 
purpose of this note is to apply the simpler formula to the numerical evaluation 
of double series of positive terms. The method may be extended to multiple 
series of order p > 2. If the double series converges one may sum by rows 
(or columns), using the ordinary sum formula twice. The method is to take 
out a rectangular block of mn terms and then apply the formula to the remaining 
terms. By taking v(i and n sufficiently large one may cause tlie series resulting 
from the formula to converge sufficiently rapidly to obtain the sum to the' 
desired number of decimal places. For practical work the ^rror may be es- 
timated because of the asymptotic character of the series involved in the Euler- 
Maclaurin formula. 

Write this in the form 


( 1 ) 


%m = H- 4/to - 4/w - + 0“^ - 

/''(») - /"W , r%) ^ ru) , , n. „ . 

30240 + 1209600 ’ ’ (2?)T 

If 5 CO one has accordingly in the ordinary case of convergence 

8) I ™ m ^ „ 

Now define r(a;) = 2 y) - f u(x, y) dy + ^u{x, h) - + 

y-b Jf, 12 

^^ ' 720 - 2 = j y )^^ + y ) — — + 

2/) 


oO cq 


^ fl— I 6^1 ^ ij“l 

720 ~ ' * ' > then. y) = u{x^ y) -j- 'oi^ -f- 


(3) 


v(^)<h + it(i) - !!W + 


-f 

- MB + Wi) + + . . . 


„V/i \ ^ 

30240 ‘ S S 


* Presented to the Society, Nov. 30, 1934. 

* W. F. Sheppard, "Some Quadrature Formulae," Proc, London Math. Soc.. Vol xxxii, 

1900. ’ 

» J. 0. Irwin, "Tracts for Computers,” No. X, Cambridge Univ. Preas, 1923, On Quad- 
rature and Cubature. 

' W, D. Eaten, "A Remainder for the Euler-Maclaurin summation formula in two 
independent variables,” Amor. Journal of Math., Vol. 54, 1932, pp. 266-275. 
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Il-’l b-l » n-1 

Instead of this one may use ^ ^ u{x, y) + ^ w{y) + ^ v{x}. The scheme 

X=al l/B-l Z^l 

of the double series may bo illustrated by a sketch of a quadrant of the asy-plane 
in which the point (tc, y) represents the term u{Xj y). 

Evidently by taking a combination of results from (3) one may evaluate 

quite readily such finite sums as 2 V) where q and i arc large. 

y=ir 

As an illustration of (3) consider 2 S Here one needs to 

evaluate the integral of the summand. The transformations x = .ay tan 6 
and y ~ Ijt lead to a form which may be integrated by parts. The more 
complicated form 2 2 + 26a;7/ + cy^y* for the case in which s > 3/2, 

a > 1, might be handled by using x = 1/t and approximate integration by 
Simpson’s rule. 

Take as a second example 2 2 (^ + P > 2. The case of p = 4 was 
carried out by taking a = t = 10 in (3) and carrying the computation to twelve 
decimals. The series involved converge rapidly and a result was obtained 
which differed by 2 in the 12th place from the true value 0.119 733 669 448'*'. 

By summing diagonally one may convert this to the simple series 2 + 1)'^ 

I 

00 00 

or 2 (s — = S (s”* — The method of summation diagonally may 

S I 

be extended to 2 + cty)~^} p > 2, a > 0) by the applications of the 

Euler-Maclaurin sum formulas (1), (2) in succession after a triangular array 
of terms have been omitted. 

The form 2 S can be written as the product of the single series 

2. Another method of numerical evaluation is the analog of that used for 
single series by the author.® Instead of rectangles one has right prisms of 
square or rectangular cross-section. Instead of shifting the rectangles one unit 
to the right to detennine upper and lower bounds the prisms are shifted diago- 
nally so that they go effectively one unit in each variable. In the case of a square 
base each prism is moved along the 46'’ line one diagonal unit length. For 
the lower bound instead of trapezoids one uses truncated prisms. For example, 
the prism of Height Wmn is cut by two planes, one determined by the upper 
vertices u„n, and the other by the upper vertices 

Wm+i, n+i of the truncated prism. The surface z «(?«, n) passes through 
all the upper corners of the truncated prisms. Each prism is composed of 
two truncated triangular prisms. Now the volume of such a triangular prism is 
the arithmetic mean of its vertical edges multiplied by the area of its base. 


New Method for Finding the Numerical Sum of an Infinite Series," Araer. Math. 
Monthly, vol, XL, No. 9} Nov., 1933, pp. 637-642. 
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Hence the difference in volume between the truncated rectangular prism men- 
tioned above and the prism of uniform hight z = u^n can be shown to be 


(4) 


"Wm-i-l, u+1 n 2 Wbi, ft+l)/6‘ 


Let US consider series whose corresponding surfaces do not rise above these 
truncated prisms. This sort of truncated prism differs less from the volume 
under the surface than the one formed by the diagonal joining the other pair 
of upper vertices and planes through it for upper faces. The lower bound for 
the remainder is the volume under the surface extending to infinity in the 
?n and n directions plus the sum of these differences. Accordingly one deter- 
mines as the lo.wer bound for the remainder Rm-i. n-i after summing a rec- 

fn—1 n— 1 

tangulnr array X) “f,; the form 




(fi) 


(2u,H,l 4" ~ 5Wm,n)/6 “b a S Wm+i,! "b I ^ 'Wl,n+/ “b h ^ Wf,>i 

7-1 1-1 


t-1 


n put ■ pto put 

+ i£«m., -b / I Um.ndmdrt-b / / Um.ndrndn < 

j”l Jl Jm Jn Jl 

The upper bound may likewise be given as follows: 

n » r ” *1 

'Ufn,ndvid7l " h\ 2^ Vfjn—lfi 
1-1 I i-m J 


where 

(7) 


( 8 ) k = 


“ f»-l » m-l 

“ £ S , 2" = £ £ wi77 

itmjn j-i-i 1-1 

r* f” p» n »■ 

/ / V‘m,ndwln “ / / dTndtl Wm— 1,7 ^ j 

^ ^TTi— I Jj\ Jjf\ ^ai»TV — 1 


'J^ni-1,11-1 "b 'Ww.ii.-l 


^n alternate definition of k is 


(9) 


^ ^Um,ndmd7l — 1 Wm.n)* 


•c 00 


An illustration is afforded by £ £ (m + 1)“* for which h = ,46614, 

rt“l 

V — ,44586 when m — n = 10 in (8), (9). In this case (5) gave an error of 
"14 X 10“® and (6) an error of 10~^. 

S and T may be evaluated by the method published in the Monthly.® 

One must assume that k increases with m and n. It is evident that for this 


• Loc, cit, 
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method and for the one in the Monthly differentiability is not required but 
only integrability, conditions less restrictive than those required by the Euler- 
Maclaurin summation formulas. It- ia also clear that the method may be 
extended to multiple series of positive terms of multiplicity greater than tyro. 

DsPAIlTMEiST OF MATHEMATICS CHESTER C. CaMP 

University op Nebraska 
Lincoln, Nebraska 
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REPORT OF THE ANNUAL MEETING OF THE INSTITUTE OF 
MATHEMATICAL STATISTICS 

The meeting of the Institute of Mathematical Statistics for 1936 was held in 
Chicago on December 28-30 in connection with the meetings of the American 
Statistical Association and the Econometric Society, 

In addition to the sessions at which voluntary papers were read, a session with 
invited papers was held on the morning of December 30. At the invitation of 
the Program Committee, Professor P. R. Rider presented a paper on "Recent 
Advances in Mathematical Statistics: Factorial Design" and Professor Harold 
Hotelling spoke on "The Analysis of Sets of Correlated Variates.” 

Professor C. C. Craig of the University of Michigan and Professor A. R. Cra- 
thorne of the University of Illinois constituted the Prograni Committee. 

At the business meeting of the Institute, the following officers were elected 
for the year 1937: President, Dr. W. A. Shewhart; Vice-Presidents, Professors 
P. R. Rider and B. H. Camp ; Secretary-Treasurer, Professor A. T. Craig, 

The Institute voted that it would presumably hold its 1037 meeting with the 
American Mathematical Society, 

Allen T. Ceaig, 
Secretary. 


NOTICE TO SUBSCRIBERS 

Plans are under way to include in the Annals a new section, entitled “Numer- 
ical Illustrations of Statistical Methodology.” This new section will be a 
regular feature of the Annals, and will deal with the application of statistical 
technique and theory to the solution of problems in various fields. It is hoped 
that this new section will be of considerable value to those who are primarily^ 
interested in numerical applications of the more recent theoretical developments 
in mathematical statistics. 

The Editor will welcome contributions to this new section of the Annals, 
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REGRESSION AND CORRELATION EVALUATED BY 
A METHOD OF PARTIAL SUMS 

By Felix Bbenstein 

“To be 8ure, Laplace viewed the matter in a aimilar way but he selected the 
absolute value of the error bb a meaeure of loss. But if we miebake not, this 
position is certainly not le&s arbitrary than our own; thatis to gay, whether the 
double error is to be considered just as tolerable as, or worse than, the Bimple 
error twice repeated and whether it is thus more fitting to ascribe to the double 
error only a double weight, or a greater one, is a question which is neither in 
itself clear nor determinable by maiheTnatioal proof but ban to be left entirely 
to individual discretion. 

“Furthermore, it cannot be denied that the assumption under discussion 
violates the principle of continuity and precisely for this reason the piooodure 
based on it strongly defies analytic treatment while the results to which our 
principle leads have the advantage of simplicity as well as of generality." — 

F, Q, 0auaa: Theoria eombinaiioMs obsenalionum, pats prior, art, 0. 

Since the “Theoria Combinationis” of C. F. Gauss appeared in the year 1821 
a century of Mathematical Statistics has been dominated by the ideas of this 
classical treatise — ^ideaa whose fertility does not seem to be exhausted even 
today. ■ 

The germ of moat modern contributions to mathematical statistics—in fact 
also those of Karl Pearson and his school — go back decidedly to this paper. 
Though the immediate achievements of Gauss are so conspicuous as not to 
need any comment, a true critical appreciation of the work can be gained only 
by comparing it with the previous methods of Laplace, superseded by those of 
Gauss. 

For such critical appreciation, C. F. Gauss himself has prepared the ground 
in the lines quoted at the beginning of this article. To Gauss the standard 
deviation is a measure of uncertainty or risk of a game in which the errors of 
observation are considered as causing only losses. In this he follows the lead 
of his great predecessor. The difference between them is that Gauss adopts 
the square of the error as a measure of the loss while Laplace adopts its absolute 
value for this purpose. Either choice frees the error from its sign so that the 
loss is the same regardless of the sign of the error. 

Gauss considers this choice of the measure of the loss as purely conventional. 
.Therefore he feels justified in adopting the square of the error because in adopt- 
ing the square instead of the absolute value of the error, the mathematics he 
uses remains in the easily accessible domain of analytical processes, This 
creates for these methods a superiority in elegance, simplioity, and generality, 

The modern developments of mathematical statistics, based on the principles 

IT 
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of Gauss, have confirmed the correctness of this viewpoint. This has proved 
true particularly in tlie theory of analysis of variance developed by R. A. Fisher 
and in the more general theory of semi-invariants, first defined by N. H, Thiele. 

The inadequacy of the Gaussian method seriously impairing its value for 
statistical use has come to light through the investigations of Karl Pearson of 
distributions of one and two variables. Since the moments of higher order 
involve standard deviations of increasing magnitude the characterization of the 
distributions by means of the moments, in lino with the Gauss-Thiele concepts, 
becomes practically impossible. Therefore it was of the greatest interest that 
Lindeberg was able to derive an expression for the standard deviation of a 
measure of skewness constructed not on Gaussian but on Laplacian lines, 
namely based exclusively upon the sign of the error. The mathematical diffi- 
culties surmounted by Lindeberg by a very involved and difficult analysis — 
with some clearly indicated gaps in the proofs — are precisely of the character 
of those that Gauss wished to avoid. Encouraged by the success of Lindeberg, 
I have developed in two papers^ the standard deviations of more general mo- 
ments and the correlations between them of which the mean deviation of Laplace 
and Lindeberg’s measure of skewness are special cases. The proofs have been 
arrived at by a rather simple and rigorous procedure. These new moments, 
together with the old ones, form a new system of statistical characteristics by 
which a distribution in one or two variables can be described by expressions 
of lower order and therefore of greater precision. This method makes un- 
necessary the use of moments of higher order than the third. 

But another point of interest is still involved. It has been assumed that the 
Gaussian characteristics give a greater amount of information than those of 
Laplace. This is proved, however, only for the cose of the normal distribution 
ft 1 1 

recognized by Gauss himself in his paper of April, 1816, 
V 

that appeared five years earlier than the Theoria Combinationia Observationum. 
In article 6 of his paper, he says, that the constant of a normal distribution 
obtained from one hundred observations by the use of ■ the standard error is 
as exact as that obtained from one hundred fourteen observations in which 
the mean deviation is used. Hence with a given number of observations only 
the equivalent of 88% of the total are used by the second method. This does 
not hold true for all distributions. The following theorem can easily be proved : 
The amount of information as defined above, furnished by the use of the mean 
deviation is greater, equal to, or less than that furnished by the standard devi- 
ation, depending respectively upon whether 


• Felix Bernstein: *^Die mittleren Pehlerquadrate und Korrolationen der Potenzmo- 
mente nnd ihre Anwendung auf Funktionen der Potenzraomentc,*’ Metron, Vol. X, N. 3 
(Nov. 1933). 

Felix Bernstein: “Uber don mittleren Fehler der Potenzmomonte.^* Zeitschr, f. di ges. 
Vera.-WiBBensohjift, Bard 30, Heft 3, March 1030. 
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052 - 1 ) I 4 (ft - 1 ) 

where 


6o = 


}ii 

¥ 




B 

a 

JW2 


I 


jufc the fc-th moment and i> = the mean deviation. 


For example, in the distribution ~ e ^1®', the mean deviaitiou furnishes a greater 


amount of information than t^he standard deviation.^ 

In the present paper, we shall discuss the practical use of expressions for 
correlation and regression in which the new type of statistics formed along 
Laplacian lines will be used. These new expressions are of a linear form and 
can be computed therefore more easily than those of Karl Pearson, The amount 
of information given by these expressions is less than that given by the expres- 
sions of Pearson if the normal law, in two variables, is fulfilled. For other 
distributions, however, this is not generally true. The determination of the 
standard deviations of these new expressions is given in Metron,^ 

The application of the new expressions of regression and correlation to grouped 
data is set forth here for the first time. The method is strongly recommended 
for all cases in which the data lose reliability with increasing deviations from 
the mean. Deviations in the new method enter the expressions only in the 
first degree and not in the second as in the case of Pearson's. It is obvious 
that the influence of the doubtful extreme readings is, therefore, considerably 
lessened! Since our expressions are linear, no adjustments for grouping (Shep- 
pard’s corrections) are necessary. 

It ought to be mentioned here that linear expressions for the measurement 
of correlation have been set up before. 

K. Pearson (Biometrika) and Egon Pearson (Biometrika) have derived an 
expression called “linear correlation ratio” which in case of linear regression is 
identical with the correlation coefficient, 

K. Pearson also discusses the linear correlation coefficient 


r - 


1 / frysg^ , 

2\ xsgx'^ ysgy)* * * 


* To this second type of distribution curves also belongs y = <;-(») where »(a!) is the mean 

of two Gaussian curves with the same origin, i.e, ^(a;) = «“■'**** + 

\V T V ’T / 

1.6 < fc < 3.4. 

I owe this remark and some other valuable suggestions regarding the subject of this 
paper to Mr, Myron Fuchs. 

* Op. cit> 
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suggested by Lenz and various other linear expressions, all similar to our expres- 
sion (1). He finds that they are all equal to his quadratic correlation coefficient 
in the case of a Gaussian distribution. 

However, their expressions were not recommended by those authors for the 
determination of correlation between quantitative variables, because — 

1. No easy and practicable methods were given for their evaluation in the 
cose of grouped data. 

2. Their standard deviations were not determined. 

We now proceed to define the new formulas and to describe the methods for 
their evaluation. The proofs are furnished in the Appendix to this paper. 


Let ri and r% denote the regression coefficients oixony and y on x respectively, 
and f, as usual, the coefficient of correlation, and by £ and y the arithmetic 
means of the a:’s and y*8. Let us take x, $ as the origin, so that x, y are the 
deviations from the mean. We have 


( 1 ) 


fi 


Ti 


Sx 

+y 

Sy 

+y 

Sy 


or fi — 


or n 


Sz 
+x 

^ «= Vri X r 2 


Sx 

-V 

Sy 

-y 

Sy 

— X 

~~§x 

— X 


Sz denotes a partial sum of the x’s, this sum being extended over all the x’s 

of the observations whose y is positive and the other sums have a corresponding 
meaning. 

It should be noted though that if data occur whose ^/-deviation is 0 (practically 
never in a grouped table) one-half of the sum of these x'a should be added to Sz. 

. . . . . 

In the S a similar addition should be made in case observations occur in which z 
is zero. (See Table IV.) 

The formulas (1) and all following ones will be proved in the appendix to this 
article .■* 


* Using T\ and ri of (1) tho rcgresBion lines are y - T 2 X and x ^ fiy. They are thoee 
straight lines which fit the data best according to the method of least squaree, if the weight 
of the deviations is taken inversely proportional to the absolute value of the variable. 
Taking x for instanoo as the independent variable, r* is the value of m which minimizes 

(v — w*)’ (tho sum exlonded over all data x y). 
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The standard deviations of and ra are 


a 


<r 


fi 



(1 + m{m — 2r)) 


( 2 ) 


2 


= ^ (1 H- - 2r)) 


(Si 

where m = ^ 

(Si 

>Sy 

where 

(Sj/ 

4-2 


We are now going to illustrate the computation of r and for this purpose 
we shall use a table of Pearson’s which gives the correlation between the heights 
of fathers and daughters. 

The totals at the right and lower end of the table are first computed and 
the bracketed numbers are the sums of the numbers that precede. The 
means are 


1659.5 - 1179 480.5 

1376 ^1376 


1660.9 - 1390 260.6 

1376 " ^1376 


whose signs determine on which side of the working mean to “quarter” the 
table. This quartering is done in Table 1 by the lines vv and hh. Then the 
totals above the heavy horizontal separating line hk and those to the left of 
the vertical separating line vv are found, e.g. 2, 4.5, 7.25, • • • and . 6 , . 6 , 0, • • ■ . 
Multipl 3 ring these totals by the respective class marks, we find the outside lines : 
18, 36, 60.75, * ' • and 6.6, 6, 0, ■ ■ • . 

jSi is now = 1107.6 — 420.6 = 687, and an adjustment for the fact that a 
-y 

working mean has been used has yet to be made. This adjustment is xN-^ 
where N~u is the number of negative y's. — 728.) 

We have therefore for the adjusted values 

= 1107.6 - 420.6 4- ?^-728 = 825.07 
-y 1376 

= 1179 + 11^-728 = 1433.21 
~y lu7o 

Ti * .6767 Ti = .6170 

r == .546 

The standard deviations, according to the formulas (2) are 

Vr, = .031 ffrt ~ “927 
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The standard deviation of = Vi X rz has to be estimated by using the 
general formula for the standard deviation of the product c of two variables 
a and b ; 


s 


2 


a« + P + 


ab 


R being the correlation coefficient between a and b. Since -1 < R < -i- 1, 
substitution of these limits for J? leads to the inequalities 

<(? + ?)' 

putting a = n, b = Vi, c = / we have 


ri J ’2 r ri 7-2 


Considering the relation o-r = ^ 

It 

we have 2 r(vrir 2 “ < ff, < 2r(o-r,7’2 + o-r,ri) 

from which we derive with sufficient approximation 

(T, < -030 

A slightly different arrangement for computing r has been made in the 
following table. 

TABLE II 


Correlation between diameter of ike stem and length of the lonesi flower 'petal of 
, Trienlalis europaea* 



PS 

3 

16 

34 

45 

' 30 

0 

2 

0 

0 

0 

0 


PS 



-3 

-2 

-1 

0 

1 


8 

4 

6 

6 

Total 

1 

-4 

1 











1 

7 

^3 

1 

4 

1 

1 








7 

29 

-2 

1 

9 

10 

3 

1 








33 

-1 


2 

9 

22 

9 

2 

1 





46 

27 

0 



8 

19 

20 

4 

1 





62 

S 

1 

1 



7 

18 

.12 

6 

4 




48 

1 

2 




1 

8 

9 

3 

2 

1 



24 


3 






3 

6 

4 

1 



14 

1 

4 







2 

2 

1 

2 


7 


6 









1 

3 


4 


6 









1 


1 

2 

Total 

4 

15 

34 

53 

66 

30 

19 

12 

6 

6 

1 

234 


*E. Czuber: Die statistischen Forschungsmethoden, Wien, 1921. 
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table III 

X = Diameter of the slem. 

y = Length of the longest flower petal in millimeters. 
Working mean, - .825, Vm = 34.6. 


Class width 

of a: =. . 

4 mm. of 2/ »= 6 mm. 





Total 

P.S. 


Total 

p.a, 

X 

iitnoa x 

timos X 

1/ 

times y 

timea y 

~4 

16 

12 

-4 

4 

4 

-3 

45 

45 

~3 

21 

21 

-2 

68 

68 

“2 

60 

58 

-1 

63 

46 

-1 

46 

33 

0 

(182) 

(170) 

0 

(130) 

(116) 

1 

30 

6 

1 

48 

8 

2 

38 

4 

2 

48 

2 

3 

36 

0 

3 

42 


4 

20 

0 

4 

28 


6 

25 

0 

6 

20 


6 

6 

0 

6 

12 



(166) 

(10) 


~(198) 

(10) 

Mean 

-27 



+68 



The P.S. columns are the partial sums as explained in the previous table. 
The work of multiplying the totals by the class marks and of adding them has 
been separated here from the table. 

We obtain N « 234, =* 106, ^ 136 

<yj 

170 - 10 ~ X 13S 

fi “ .805 

130 + 2^ X 135 


116 ~ 10 + ^ X 106 


182 - X 106 
r = .82 

Pearaon^B coefficient for this table is r « .83. 

Finally we illustrate by a small non-grouped table where the partial eums 
can be written down immediately. 
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TABLE IV 


Correlation between Ages of Husband and Wife 


Age of 
Husband 

Age of 
Wife 

Deviation 

Huaband 

Deviation 

Wife 


22 

18 

-8 

-8 


24 

20 

-6 

-6 


26 

20 

-4 

-6 


26 

24 

-4 

-2 


27 

22 


-4 


27 

24 

-3 

-2 


28 

27 

-2 

+ 1 


28 

24 


-2 


29 

21 

-1 

-5 


30 

26 

0 

-1 


30 

29 

0 

+3 


30 

32 

0 



31 

27 

+1 

+1 


32 

27 

’ +2 

+ 1 


33 

30 

+3 

+4 


34 

27 




35 

30 

+6 

+4 


35 

31 

+6 

+5 


36 

30 

+6 

4-4 


37 

32 

+7 

+6 


Ave 30 

26 





Here O-deviations occur in the third, column. Hence^ 

= 26 + i X 8 = 30, Sx = 33, Sx = 31, Sy = 36, 

H-* +V +1/ 

n .86, Vi =s .91, r = .88 (Pearson'a r = .86) 

Appendix 

Proof of formula (1), page 1. The following notations will be used: 
(/(a?))® ^ probable value of f{x) 

(/(j/))J = probable value of f{y) for a fixed x. 

sgx - eign of a; = t—. for a; 0. sgx = 0 if a; g 0, 

l®i _1 


» ScQ page 7. 
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The assumption of liueoi' regression means that 
(4) 2/2 - 2/“ = - x^) 

We multiply both sides of (4) by some arbitrary function (}>{x) of x and get 

(yl - - x^)4>(x). 

Both sides are functions of x, We shall take their probable values for all x*a. 

Now, for a fixed x, yl<fiix) = (y<l)(x))l and the probable value of for 

all a:'8 is equal to the total probable value {y<i>{x)f. Bo we have 

iym" - - x^mx)y 

„ iiy - 


rv,x = 


({x — :c“)0(a;))" 


If now we take a:V the origin, we get 


Tv:x ^ 


and similarly 


Txiv — 


(yi^(a;))° 

{xti>{x)y 


ix$i{y)T 


" {yh{y)y 

where ipi is another arbitrary function. 

Replacing the probable values by the respective arithmetic means we get 

fel r - and r - 

with X, y as the origin. 

By a suitable choice of the still arbitrary functions if> and <^i , we may derive 
all the various expressions for regression coefficients. Taking, for instance, 
'•pix) - X, (f>i{y) = y, we get Pearson's expressions. Taking <f)(x) = 8g{x — ai), 
^i(^) - Mv - “*)> “2 being constants, we have 


/7\ - «i) 

~ Sx sg(x - at,) ' 
and if we make ci **= cti = 0 




Ty;x — 


_ Sj/sgx 
" Sxsgx' 


r*:u = 


Sx sgiy - ttj) 
Sy sg{y - a 2 ) 


Sxsgv 


Szsgx^ Sysgy 

Since Sx~Sy = 0, we can add Sy or S?; to the numerators and denominators. 
Adding Sy to the numerator, Sx to the denominator and multiplying both 
sides of the fraction by ^ we get 

_ '^Sy(sg{x ^ ai) H- l) 
ijSx(sg(a; - on) + 1) 


( 9 ) 
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Instead of (9) we can write 


( 10 ) 


S y-\~ y 
a: > ai x~ai 

S ^£1 re 

in > ai X = ai 


since the operations of (9) multiply the y ordinates by 0, 1 according as the 

ai’s are ^ ai . 

The expression (10), with a suitable choice of should be used for the purpose 
of numerical calculation of r. For instance, when calculating r from the data 
of Table IV, we took on = aa — 0 and had 


iSj/ + i S y 
-\-x a; = 0 

d-x 


When dealing with data which are arranged in a grouped table (Tables I 
and II) we take «i equal to the jj-ordinate of that classline which is nearest to 


the mean 


■( 


In Table I on = .5 *- With that choice of ai the sums 

S disappear and the sums S are equivalent to the corresponding suras 

X = 0(1 X > 0 !l 

S, Hence we have 


+x 




' Sy 

Sx 

(11) 

fti^d similarly 
ox 

T - +*' 

“ ‘ Sy 


"j-X 

+y 


Instead of (9) we can also wKte 


(9a) 


- ai) - 1) 

“ ^Sx{sg{x - ai) - 1) 


This leads to 


(lla) 


Sy Sx 

-X -y 


' It is desirable to chose the absolute values of the a’s small so that the maximum number 
of data enter into the oalculation of r. However, to take aj = aj ^ 0 would necessitote a 
division of the middle arrays of a grouped table, a laborious process. Hence the choice 
of the tt's as desciibed above. 
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Proof of the standard deviations of Formula (2). 

In my article on standard deviations and correlations of moments’ the stand- 
ard deviations of the expressions used in this article have been derived. 

In the following, the notation of the Metron article just referred to will be 
used. We use the symbols: 

P/m.r^ = 

P/m.ln = 

The summations indicated extend over all observations. The true or prob- 
able values of the same expressions are indicated by using p instead of P. 

I 

Pi/o 

= J’l * ^ 

“ 0/1 

We derive the standard deviations by defining the deviations as first variations. 

log ri = log Pi/u - log ?o/i 


Pi ~ Pl/o Po/1 

( 12 ) «•! = t(!r.)Y = (rl)* [ - '-^YT 

The probable values of the terms on the right hand side of the last equation are 
derived on pages 17-19 and listed on pages 32-33 of the Metron article referred 
to. The proofs which imply essentially a process of variation of Stieltje's 
integrals will not be given here. From pages 32-33 we take 


(13) 


so that 
(14) 


[(SP^)? - 5*^, KiP,;,)’)' = 

N ’ Lpi/o Po/i pi/oPo/iJ 


Assuming Gaussian distribution, we can put 


T s 

P*0 = ^P/IO 


Via 


^Vm 


Pii =» rVpoaPso = ’‘^P/iopo/i 


^ Felix BemBtein; ''Die mittloren. Fehler<)uadrate utid Korrelationen der Potenzmo- 
mente und ihre Anwendung auf Funktionen der Fotenzmometite/' Matron, Vol. X, N. 3 
Not. 1W2>. . 
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Hence 







Vm 


Pi/o/ 


Replacing the theoretical values by their corresponding empirical values, 
we have 


(16) crj, = ^ (1 -j- w** - 2m) where m ^ ^ 

^iV Sx sg y 

The formula for has been derived here for the value of ri as given by (8) 

i.e. ri = . In fact, we used ri = ^ in the examples in the 

Sysgy Sysg{v-a) ^ 

article, and a had some value absolutely smaller than ,5. To use equation (16) 

for the standard deviation of n is within the limits of the required degree of 

accuracy ; hence we shall disregard the difference. In a later paper the standard 

deviation of n for any a will be derived by using the method describe^n the 

Metron article, for a different purpose. 

To prove the statement in the footnote to page 7 

To find the value of Vz that makes 


Sf{x) {y - rixf a minimum. 


By differentiating we get 

SJ{x)iy '-nx)x = 0 
SxJ{x)x 

R f(^x) = 1 we get Pearson's coefficient, 
lijix) (a; 0) we get 

X 

r, 

a X SxBgx 
w i„r ® 


New York Univdrbitt, 

Departments of Anatomy of the Graduate School and the College of Dentistry. 



METHODS Oi: OBTAINING PROBABILITY DISTRIBUTIONS' 

By Burton H. Camp 


The emphasis of this paper will be on method* Special results will be cited 
in order to illustrate the methods rather than to summariae achievement in the 
field; for that has been done already by Rider (1930, 1935) Irwin (1935) and 
Shewhart (1933) in recent surveys. The pui'pose is to describe and to illustrate . 
most of the methods that have been used to determine exact probability dis- 
tributions, and to show that they are all derivable from one fundamental theorem. 
In order to prove this unity in a simple manner, it will be desirable to omit from 
consideration methods which are essentially ingenious forms of counting, such 
as are used in sampling without replacements from finite universes, and in 
finding* the sampling distribution of a percentile. 

The general problem to be discussed may be stated as follows: N individuals 
(hi ‘ ’ I ^JY) are drawn, one at a time with replacements, from a universe whose 
probability distribution is A certain single valued function of the t’s is 
formed. This is called a parameter of the sample, and is frequently also, 
but not necessarily, a useful estimate of the corresponiding parameter of the 
universe. The problem is to find its probability distribution, /(a;). As usual, 
a probability distribution is a function which is required to be defined, except 
perhaps at a set of measure zero, throughout the infinite domain of its variables; 
it is nowhere negative, and its integral over its domain is unity. 

Most of the more recent developments of the theory relate to a more general 
form of this problem. Instead of N individuals, there are sets of n individuals 
in each set, and these sets are drawn respectively from M(M S N) universes, 
each of which is described by a function of n independent variables, thus : 

(1) • ••,<.); (t = l,...,Af). 


Instead of a single parameter there are P parameters, and each is a single valued 
function of the observed values of the nN individuals in the sample, thus: 


( 2 ) Xi - g{(i\ 


( 1 ) 




j } 




4")!(i = 1, •■•.?) 


The first method to be described is fundamental and will be designated as 
Theorem 1. Let it be required that each g as described in (2) be not only 
single valued but also constant at most in a set of measure zero in wJV-way 
space of the i*&. Then 

(I) I /(*1, •■•,!,)«= I , (<f) dT 


* Preaentod to the Americun^ Mathsiriatical Sooiety at a meeting devoted to expository 
papers on the theory of atatistioB, April 11, 1936: 

go 
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where ^ is the space of aj’s and T the space of the i's, p is any measurable set 
of points in X, and q is the set in T for which g is in p. Often p is the P dimen- 
sional cube (iCi + A a:, a = 1, • • • P) at the point {xi, • ■ • , Xp) and then q is 
the set where 


(3) ^ Pi ^ + A®; (i = 1, • ■ ■ , P) 

and <!> is the simultaneous distribution of the sets of t’s, 


(i) 9 ifi }k )"* 9 (li , • ■ • , t), ). 

In this is the universe from which the set of <‘s is drawn. Obviously, 
if JV > M, some of the are identical, and then it is assumed that the several 
sets are drawn independently. Often, all Of the N sets of i's are drawn from 
the same universe. Then Af = 1 and all these 0's are identical, and (4) becomes 

* = [/■’((!"; [Ail"', 

In the special cose where there is but one parameter (P = 1) and but one 
individual in the sample {n = N = 1), and p is an interval, formula (I) becomes 

r*+A» ^ 

(la) '/ Six) dx= (pdt; 

and in the very special case where it is also true that g is an interval it becomes 


»(!)> 


(Ib) 


fix) = 0(i) ' 


dx ' 


provided also that certain derivatives (to be specified later in the proof) exist, 
where i is now the inverse solution of the equ^ition, 


( 5 ) 


a = p(0- 


The proof of formula (I) is immediate, if one is willing to assume the existence 
of the probability distribution f ; for then the left side is by definition the prob-' 
ability that the aj'a lie in p, and this is also the meaning of the right side of (I). 
(la) can be proved without assuming initially the existence of f(x), for then 
the existence of f(x) can be inferred from the existence of the right side of (la), 
because f(x) may be set equal (except perhaps at a set of measure zero) to the 
upper right hand derivative, with respect to Ax (Ax is a variable, and x is fixed), 




of f 0 d(j provided that one adds the condition that this derivative is nowhere 


infinite. The point at issue here is merely the existence of a primative for a 
monotone increasing function of Ax. (Ib) may be derived from (la) by taking 
the derivative of both sides with respect to Ax, if the derivatives are continuous. 

Theorem I, in these various forms is used a great deal, especially in the lost 
form (Ib). This affords one freedom to choose the most desirable function 
for purposes of tabulation. R. A, Fischer's z distribution, a logarithm, is an 
important illustration. Many authors have been interested in so choosing the 
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function that its distribution shall be normal, They include several of the 
older writers, and more recently H. L. Rietz (1921, 1927), and G, A, Baker 
(1932, 1934). However, the theorem is of special importance in the theory, 
for ail the other principal methods of obtaining probability distributions are 
essentially corollaries of it. These corollaries will be called Theorems II, III, 
and IV. 

Theorem II. Let p (the measure of p) and q (the measure of q) be infini- 
tesimals of the same order and let both the oscillation of f{i.e. maximum /- 
minimum /) in p and the oscillation of 0 in ^ be infinitesimals; then (I) may be 
written, 

(II) /p = 05, 

where / applies to any point of p and 0 to the corresponding point of q. This 

equation ( 11 ) is an approximate equation in the sense that diflferences of higher 
order than those retained are neglected. In particular, with the conditions 
used in formula (la), equation H becomes 


/Aa: = 05 . 


The left side of (II) is an approximation to the probability sought. The right 
side shows that, in order to evaluate it, one need only find the volume in T space 
of the differential element q and multiply it by the value of 0 in 5 . Formula (II) 
expresses the so-called geometrical method used by many authors, e.g., by 
R. A, Fisher (1916, 1926), by Wishart (1928), and by Hotelling (1926, 1927). 
Tlie chief difficulty in connection with it is in finding the volume of wA^-dimen- 
sional q. In order to display the advantages and disadvantages of this method 
wc shall pause at this point and look at a concrete example.^ 

Let two individual (h, ts) be drawn independently from a normal universe 
and consider the simultaneous distribution f(x, y) of the sum, x = k -{• k, 
and product, y = kk, the mean of the universe being chosen oa the origin. 
Here N — 2, n — I, M — I, and so. 


( 6 ) 




0 ^ 


2ir<r^ 


1 


“ 2 ^ 

e 


The point set q is the area lying between the two adjacent hyperbolae, 

kk = y, tik = y -h Ap, 
and also between the two adjacent lines, 

k k = X, k-\- k — X A®, 

where Ax and Ly are infinitesimals and are equal. This area may be computed 
by simple integration and is: 


* Seo gIbo G. C, Craig (1936). Craig uses anothor method to be explained later (formula 
Ilia). 
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^ _ 2A.'c Ay 

— iy 

= 0 


Hence II gives us immediately the desired result: 

J 2«r3 I 

fix, y) AxAy = — 5 c 




■s/x^ — 4y 


Ax Ay, 


= 0 if < 4y, 


if > 4iy, 

if < 4y. 

if > 4y, 


If = 4y, q is an infinitesimal of lower order than p = (Ax)^, and so Theorem II 
does not apply. In this case we must go back to Theorem I, and from that we 
can learn that the probability, 



/ dx dy, 


is an infinitesimal of the first order if p = Ax Ay = (Aa;)* is of the second order. 
Hence it cannot be approximately represented by a finite number times p. 
The oscillation of / in p is infinite. The form of the surface f(x, y) is interesting. 
The ordinates rise to infinity on the contour of the parabola = 4y, and \"anish 
within it, Tho surface is symmetrical with resj^ect to the plane a; = 0, but 
not with respect to the piano y — 0. However, it is clear that the total prob- 
ability of any given product, y (i.e. the probability of this y for all possible 
values of x), is the same as the total probability of hence 



and the corresponding formulae, 



and 


u 

2 r 

Jo 




1 


dx 


TTCr- JO — 4y 

must be equal; both may he reduced to the single form 



(y > 0), 


iy < 0)> 


if y 7 ^ 0 . 


This is the probability distribution of y. 

With this example before us, let us now reconsider the theory: 

(f) The requirement (in II) that the oscillation of </» be infinitesimal in q 
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will be satisfied if one can show that 0 may be expressed as a continuous function 
of the parameters {xi, x^, - • • , icp). In our example these parameters were 
X and If and 4> was so expressible (6). But if we had tried initially to find by 
means of (II) the distribution of the product y, independently of what values 
X might have; wo should have been stopped at this point, because <j) is not 
expressible in terms of y alone. We should also havo been stopped by the 
requirement that g he infinitesimal of order Ay, for q would have been the 
space between two hyperbolas and its area for any fixed (Ay > 0) would have 
been infinite, But, when thus stopped at that first point, it would have been 
clearly indicated to us that the distribution of y might have been found via 
the detour of finding the simultaneous distribution of both x and y, because 
an attempt to express in terms of y would have led to the given expression in 
terms of both x and y. For a similar reason R. A. Fisher (1925) was able to 
find the distribution of the vaxiajice by finding first the simultaneous distribution 
of the variance and the mean, Also, he was thus able to find the distribution 
of the coefficient of correlation by finding first the simultaneous distribution of 
alKthe first and second order moments. 

(fi) A distinct advantage of this method is that q is independent of the 
universe 0, so that once found it may be used in connection with any universe 
which satisfies the condition that it can be expressed as a continuous function 
of the parameters. Thus, the distribution of the sum and product in our 
example may equally well be found for the universe described by the Type III 
curve, > 0). For, then 

<^ = A" (i i, 0-'''+'’’ = A’ y e"", 


and so, using one-half of the same g as before, since now x,y ^ 0, 


/(x,y} = ye’"* 


= 0 


V'x* — 4y’ 


if 

if 


From this, F(y) can be found by integration (c./. Kullbach, 1934) 




= av f 


ax 


V4i V*® 4y 


dx 


.2 fto ' 

^ ^ j e 

2 Jo u 


du. 


X > 4y, 
X < 4j/. 


As another illustration, consider a normal universe of n iiitercorrelated vari- 
ables in which all the total intercorrelations are equal to r (e,g., the statures of 
n brothers) arid let the sample be a single group of n (one individual for each 
variable). 

* “ W”" R ® ’ 

where R » (1 - — (n — l)r], ki = (I - — (n — 2)r], and 

ki = — r(l “ r)'* . Suppose one wishes to find the simultaneous distribution 
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of the variance x and the mean y for such samples.® Since for Student's problem 
Fisher has found tlic value of q for this x and y to be 

q = cx ^ AxAy, 

their distribution f{x, y) for this universe may be written down immediately. 
In' terms of x and y the bracket in the exponent of 0 is i/(kin - hn + W) 
+ xn{ki ^ fca), and so /(a:, y) is the product of q and this form of 0: 

f{z,y) = ^ ^ i ^ ~ ~ ^ - n{ki ~ k2)x]. 

(iii) Another attribute of this method is that it sometimes lends itself to easy 
extensions from a simple ease where there is only one restriction (A^ - 1 degrees 
of freedom) to similar cases when there are more restrictions, Thus R. A. 
Fisher (1924) proceeded from the variance of a sample from a single universe 
to the variance from a set of universes, as required in the theory of analysis of 
variance; and thus also (1915) he had proceeded from the distribution of ?• to 
that of multiple R] and Hotelling (1927) showed how these distributions could 
be obtained when the values of each variate were themselves intercorr elated 
(as in a time series) and not merely correlated with values of the other variates. 

Theohem hi. Now let us consider again the fundamental form (I). For 
convenience let nN = m. If the conditions will not permit us to write the right 
side in the form in (II), it is still possible that we may be able to find that 
(m + l)-dimensional volume by some other method. In particular, whenever 
it is possible to iterate the integral once we have the formula: 

(HI) f fdX= [ dr I 

Jp Jt* JSbi 

where is the section of q by space at the point (fi, • * > , im-i) of. r space, 
r space being the space of the (h , • ' • , ti»-i) coordinates. With added condi- 
tions one may deduce from (III), for the case where there is but a single para- 
meter X, the approximate equation : 

(Ilia) fdx^dx [ dr • 0(ii , • • • , U 

J ^ 

in which is supposed to have been expressed in terms of the other coordinates 
by solving the equation x = g{ti /"■ ,tm)> It is an approximate equation in 
the same sense as (II) was. Sufficient conditions for this change in the left 
side of (III) have already been mentioned in discussing (II), The propriety 
of making the corresponding change in the right hand side may be left for 
determination when the form of <(> is given. It will perhaps be sufficient here 
to point out that our earlier example illustrates both the case where this change 


* A special case of a more general problem solved first by R, A. Fisher. 
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is permissible and where it is not. For, let it be required to find the distribution 
f{y) of the product y = Uk without reference to the sum, h + Formula 
(III) yields 


(7) 


'tf+AV 


/’(l/+Av)/fi 

f(y)dy ^ 2 dii dk 

Jo Ju!i\ 


1 

27rff^ 



e- 


This is valid for every value of y including = 0. If y 0, we may change 
the right hand side ns in (Ilia) and obtain as the probability that y is in the 
interval (y, i/ + A^) : 


( 8 ) 




dii + e, 


where e is a differential of higher order than Ay. This may be proved by com- 
puting the difference between the value of (7) when k has constantly the value 
{y + Aj/)/h and when it has constantly the value y/k. If y = 0 this change 
in the right side of (7) is not valid; it is easily seen that in this case the integral 
on the right of (8) is infinite. It may be shown, however, in this case that’ 


(») 



dy ^ ^ - 


1 rjL^!_^L 
Jl iC “s/ JC* — 1* 


and that this is an infinitesimal, and that it is of order as small as one. 

Many authors think of (Ilia) as the fundamental formula in the theory of 
probability distributions, One of the simplest and earliest applications of it 
was to establish the so-called reproductive property of the normal law: that 
the sum of two variates is distributed normally if each is distributed normally. 
Jackson. (1935) has used it to establish a similar property for two Type III 
distributions which have the same exponent of e. Usually this integral is 
difficult to evaluate when N > 2 because of the unsymfnetrical form into 
which it is cast, but \vhen N = 2 and there is but one parameter (Ilia) it is 
perhaps the most convenient of all the formulae. 

Theorsm IV. An exceedingly useful formula is obtainable from (I) in the 
following manner. Let'fl(xi, ••• , 3;^; , kq) be a finite single valued 

function of the old parameters (as) and of some new parameters (a). Subject 
to genera] conditions to be stated we may wite: 

(IV) ” Jr 

an identity with respect to each «, where is the result of substituting (2) 
for the a:'s in B. 

Since this theorem has not been proved in this general form, an outline of 
the proof will be given. Sufficient conditions ate: 

(a) All the integrals involved shall exist. 
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(6) If p is limited (in the sense that it lies within a finite hypersphere), so 
is q, and conversely. 

Proof. Let Xo be a limited p set and To the corresponding q set such that 
both (c) and (d) hold (e > 0): 


(c) 

(d) 




< h 

< €. 


It is easy to see that such an and a corresponding To do exist, as/ollows: 
Let Xfl be a limited set for which (c) is true, and for which it will remain 
true no matter what points are added to Xo. Similarly, let To be a limited 
set for which (d) is true and for which it will remain true, no matter what 
points are added to tI. Presumably Xo and To do not correspond to each 
other, but we may now let Xa be the totality of all the points of Xi and of all 
tliose points of X corresponding to To, and let To be the totality of all the 
points of T! and of all those points of T corresponding to X! . Then Xo and 
To do correspond to each other and have the desired properties (c) and (d). 
Now, since 6 is finite, it is limited in Xo. Let 

(e) \e\<HmXo. 

Divide the interval (—H, H) into s equal subintervals of length h, thus defining 
in Xo according to Lebesgue the measurable sets, 
pv (t = 1, • • * , s), and corresponding j,- sets in To : 

Os 0 ^ hm Pi, 

(/) 

^ |Os ghing^. 

Choose arbitrarily any point of and let be the corresponding value of d. 
Then let 

e = ki'm Pi {i = \, “•, s), and sbnilarly let 
6* = k{ in gff (i = 1} ‘ , «). 


Then 
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Now 




{d~e)fdx 


g f \d-e\fdX^h [ fdX, 

JX(, Jxo 


and 


f (6' - 6') dX gk f d>dX 

jTa Jto 


So, as h approaches zero both sides of (p) approach limits and their limits are 
equal : i 

I 0fdX^ I d‘4>dT. 

Jxo jTo 

Hence by (c) and (d) the integrals 

j e'tf>dT, 

differ at most by and so, being independent of t they do not differ at all. 
In order to determine the form of / from (IV) one must first evaluate the 
right side, 

j^e(i>dt = , Dfa); 

and then solve the Integral equation, 

( 10 ) j^efdX = ^, 


It is the solution of this equation that usually presents the most difficulty. 
Particular forms of 9 that are being used are 

( 11 ) ^ 

in which case ^ is said to be the ''characteristic function^’ or "moment generating 
function"; and 


(12) = xf ' > ■ • x“p^, 

in which case ^ is a "moment function" or "moment'' of /. Other forms might 
be used, For example, a very convenient method of demonstrating the correct- 
ness of the usual formula for the simultaneous distribution of the correlation 
(x), means {y, z), and variances (li, i^), in samples from a normal bivariate 
universe is by the use of 

This method of finding / is not a final determination, of the probability function 
desired until it has been shown that the solution is unique, a serious problem 
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in itself; it is one of those which Professor Shohat may consider/ There are 
thi'ce methods of solving the integral equation (10) : 

(i) The first might be called guessing. Though unscientific, it is in fact 
often effective. Especially is it available if the distribution has already been 
surmised but not demonstrated. Thus, it was open to Student (1908) when 
he correctly surmised the distribution of the variance. Similarly it was open 
to Soper (1913) when he incorrectly surmised the distribution of r, 

(ii) Papers by Romanovsky (1925) and Wilks (1932) have shown how the 
problem of solving the integral equation may be shifted to the problem of 
solving a partial differential equation, but this in turn may involve the solution 
of another equally difficult integral equation in the process or determining the 
arbitrary function, 

(m) If each a be replaced by an imaginary jii and one uses a Fourier trans- 
form, one arrives at a set of formulae which are most important. For the case 
' where there is but one x and one they may be written; 

(18) [ e‘^‘f(x)dx ~ [ e‘^i,dT = 

J—^ Jr 

(14) fix) = 403) dff. 

Dodd (1926) has given an equivalent set of formulae involving only real vari- 
ables. It is easy to prove that both sets may be changed to the single formula, 

(15) /(x) = - I I cos0(x - g) d^. 

TT Jt Jq 

Kullbach (1936) has established the validity of the formulae corresponding to 
(13) and (14) for the general case of (P + Q) parameters. Wishart and Bartlett 
(1933) used the general forms to find the distribution of the generalized product 
moment in samples from an n-dimensional normal system. 

When the solution of the integral equations of (IV) cannot be found, one 
has to put up with the semi-invariants or with the moments of /, Formulae 
(IV) and (11) yield the semi-invariants, (IV) and (12) the moments about the 
given origin, and from either of these one may obtain the moments about the 
mean point. These methods are old but they are still important. Time does 
not permit me to discuss them, because it would not be proper to close this 
paper without some reference to limit methods. 

Limit Methods. It is well known that the distribution of means of samples 
taken from almost® any universe approaches the normal law as a limit as iV 
becomes infinite. This theorem is subject to great generalizations, as is indi- 
cated in paper, s of A. Liapounoff (1901), S. Bernstein (1926), Romanovsky 

* In a later paper at the same eympOBium. 

‘ There are exceptions. E. ff., meana of samples taken from the universe a/ir(a + i*) 
liave a distribution identical with the universe itself. 
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(1929, 1930) and C. C. Craig (1932). Subject to very general conditions it 
has been shown that; If the characteristic function of one probability distrb 
bution contains a parameter and approaches as a limit, uniformly in every 
finite domain of its variables, the characteristic function, of another probability 
distribution; then the first distribution approaches as a limit the second distri- 
bution. Hence S. Bernstein and Romano vsky have shown that: If the universe 
is an n-way correlation solid of a certain very general type, then the n means 

obtained by a selection of a sample of N sets of variates, ^ (b’l “!“’■■+ i<Ar), 

(i — 1, ■ ■ • , ft), have a distribution which approaches as a limit a normal 
correlation solid as N becomes infinite, A similar theorem has been established 
also in the interesting case of Romanovsky's “belonging coefficients”, which 
include K. Pearson’s coefficient of racial likeness. Also, by the method of 
maximum likelihood. Hotelling (1930) has proved that under certain general 
conditions all optimum estimates of the parameters of a frequency distribution 
have a joint distribution approaching the normal aa N becomes infinite. The 
validity of the method of maximum likelihood when used for this purpose has 
been established by J. L. Doob (1934). 

Finally, one may note an apparently new limit theorem of another type. 
Its general nature will be obvious from the following application : 

Let a sample of N be drawn from the universe, 

if i > 0, 

^ =0 if i ^ 0. 


It is readily proved, by means of (IV), that the distribution /(a;) of the para- 
meter, 

X = {h + ■ ' • + ) 

is a curve of the form, 

/ (a:) = where x > 0, 


= 0 elsewhere. 


Now let X become infinite. The universe approaches os a limit the rectangle: 

^ A where 0 ^ i < 1, 

= 0 elsewhere. 

The parameter x approaches as a limit X, where X = maximum if. The 
distribution /(x) approaches os a limit the new distribution, 

F{X) ^ NX^-^ where 0 < | X | < 1, 

= 0 


elsewhere. 
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Hence wfi have proved in a new way, what was already known: that tlie distri- 
bution of the greatest variate obtained by sampling from a rectangular universe 
is of the form 

The limit theorem implicit in this illustration can be established in sufficient 
generality, but I do not yet know whether it has other applications of value. 
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MOMENT RECURRENCE RELATIONS FOR BINOMIAL, POISSON 
AND HYPERGEOMETRIC FREQUENCY DISTRIBUTIONS* 

By John Rioudan 

1. Introduction. This paper gives the development of recurrence relations 
for momenta about the origin and mean of binomial, Poisson, and hyper- 
geometric frequency distributions from the basis of the moment arrays defined 
by H, E. Soper.** This procedure has the advantage of expressing the moments 
in terms of coefficienta Tvhicli are alike (or the three distributions and are de- 
rivable by a single process, thus providing a degree of formal coordination of 
the distributions. For both kinds of moments, the coefficients satisfy relatively 
simple recurrence relations, the use of which leads to recurrence relations for 
the moments, thus unifying the derivation of these relations for the three 
distributions, The relations derived in this way for the hypergcometric dis- 
tribution are apparently new. Apparently new recurrence relations for certain 
auxiliary coefficients in the expression of the moments about the moan of 
binomial and Poisson distributions are also given. 

This course of development involves repetition of a number of well-known 
results which is justified, it is hoped, by the unification obtained.** 


‘ Presented to the American Mathematical Society, Sept. 3, 1936. 

* Frequency Arrays, Cambridge, 1922, 
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2. Moment Arrays. As developed by Soper, frequency distributions may be 
exhibited by frequency arrays, in the ease of a single variate, in the form : 

(2.1) /(A) = E P* A* 

X 


where p* are the frequencies with which the measures, x, of the character, A, 
occur in a population. 

The substitution A - e“ leads to the moment about the origin array: 


(2.2) 


/■(«“) = 


A 



where 


Wj = 


p* X 


The symbol a is a logical or umbral symbol serving merely to identify the 
moments in the expansion of the array. 

The moment array for moments about the mean is found from the relation : 


0(0 = ^-"* 7 ( 6 ") 


where mi is the first moment about the origin. 

The moment arrays for the distributions concerned are os follows : 

Binoftnial /(e") - [1 + p(e" - 1)1" = E P*(e" - 1)' 


Poisson 


/(e") = = E 


o’(e" - 1)’ 
X I 


Hy'psTgeomeiric /(e“) = E ■ 

3-0 (tt)* X \ 

\ 

where the parameters p, n, and a for the binomial and Poisson have the usual 
significance. The parameters for the hypergeometric distribution, with the 
substitution r = s, follow Soper; Pearson (loo. cit.) uses g, r, n, where q — l/n. 
The notation (!), means 


(i)i — i(i — 1) ' ■ * (i — a; -|“ l)t 


It will be seen that, with the usual interpretation of 



as zero for a: > n, 
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tho three distributions so far as concerns ce may be exhibited by a function 
of the form 

/(«“) = £.1.(6“ - 1)‘ 

■whore ill of course depends on the distribution concerned. 


3, Moments About the Origin. The moments about the origin can then bo 
defined by the equation: 

(3.1) Em,2!= D* 

a"=0 S I 

and 

E A’(e’ - D* = E ^. £ ( - 1)— f*! e” 

s={l i-O v"0 V/ 

= £ 

«=iQ S 1 ic‘^0 

where Sx, s is a Stirling number of the second kind, as used by Jordan (loc. cit.) 
and defined by 

*13.,.= E(- = 

A'O* being in the language of the finite difference calculus, a "difference of 
nothing” that is j •«- = O'. 

The internal series terminates at a because Ss.a = 0, a; > s, as is readily 
apparent in the finite difference expression. Further ~ 0, s 5^ 0; jSo.o “ !• 
By equating coefficients in equation (3.1), m,, the sth moment about the 
origin, is given by 

a 

(3.2) w, = X) 

a=0 


The particular forms for the three distributions are as follows: 


(3.3) 

lUa — (71)2; P a 

Btnomial , 

(3.4) 

m* = 2 a* Sfi, B 

Poisson 




(3.6) 

^ V <? 

Wa — / ^ y. ,L Oxx s 

ic-D (71)3: 

ffyperffeometric 


The Stirling numbers have the following recurrence relation (Jordan loc, 
cit.) : 

(3.6) 
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This relation in conjunction with equations (3.3)-(3;5) leads to moment recur- 
rence relations. The procedure is illustrated for the binomial distribution as 
follows : 

«+i 

Vla^i = ^ (■a)* p Sx, 

ar—0 

fl+l 

= ^ (w)* P* {x Sx, a "b a) 

= p Dp tn» -f- {npmt — p* Dp m,) 

- npm, + VQ Dp m, 

where q - I - p. 

The steps in the process arc expanded as follows: 

a+1 tf 

£ (n)xP* xSf,9 ^ 52 (»l)* p* X Sx, a 

x-0 *-0 

» 52 (^)» Ss, i pDpip^,) 

x=>0 

=: pDpin, 

«-|-l 0+1 

2 j (tI !C + 1) Sx-lt « 

x-O i"0 

- n 52 , - £ a:(tt)*p*'^^(S«, i 

' ^ npma- p^DpW, 

The results for the three distributions are as follows; 


(3.7) 

Mt+i = ^pnit + pqDpVit 

Binomial 

(3,8) 

ma+i == am, -b aD<,m, 

Poisson 

(3.9) 

Iv 

- - mail - 1, r " 1, - 1) - (n -b l)Anm, 
n 

Hypergeomeiric 


Here Dp and Dt, denote differentiation with respect to p and a, respectively, 
and An denotes the difference operation with respect to ti. For the hyper- 
geometric distribution the moments are functions of I, r, and n as well as of 5 ; 
mi{l - 1, r - 1, n - 1) is the same function of i — 1, r - 1 and w — 1 as 
Mi{l, r, n) is of I, r, n. Equation (3.9) appears to be new, 
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For convenience of reference, a short table of the Stirling numbers of the 
second kind follows : 


0 

1 

2 

3 

4 

5 


0 

1 

0 

0 

0 

0 

0 


1 

1 

1 

1 

1 


1 

3 

7 

15 


1 

6 

25 


1 

10 


1 


4. Moments About the Mean. As shown in Section 2 above, moments 
about the mean may be defined as follows: 


(4.1) 


E 1)‘ 

6^0 S I 


z=0 


where mi is the first moment about the origin; 

mi = np Binomial 
= a Poisson 
= Ir/n Hypergeomeiric 

Now 




1=0 


(e*-ir=E^E(-ir"(i') 

2?aD ' 13^0 \v / 


(h-mi)a 


= 2 ^ S I A, (T,. a 

flF=0 S 1 fl;“0 


where 


(Tx.i = (-' 1) 

*=0 




(:) (. - m,Y = 




It will be observed that for mi - 0, <r*.a = ^S*.a . The internal aeries terminates' 
at s for the same reason as before. 

The moments about the mean are then given by : 


(4.2) 




g 

= 2 35 1 Ax tr*. 1 




The particular forms for the three distributions are as follows: 


(4,3) 

= £ (w)* pi » 

Binomial 

(4.4) 

a 

Pt -^0^ tf*. a 

Poisson 

(4.6) 

’‘•-h («)- ’■* 

Hypergeomeiric. 
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The coefficients giitisfy the following recurrence relation:* 

(4.6) (j'x.a+l ^ (sj — Wil)o'»j8 "h 0'x—i,a 

which in conjunction with equations (4,3)-(4.6) leads to moment recurrence 
relations as before. The actual derivation is somewhat complicated by the 
circumstance that ax,» is a function of mi and therefore of the frequency param- 
eters, rather than a constant os before. The derivation is illustrated for the 
binomial distribution as follows : 




Jtij+i 


~ X/ (^)x V ^3S, *+l 


1^0 

g+1 




a 




- £ (n)x * V^p(P*) ~ + £ (^)* P* , I 






= pDp/ij + nsptia .,1 - npju, + npn, - p\DpPa + ns^8_i] 
= VQ [wsM,_i + Dp/iJ. 


The steps in the process are expanded as follows: 

B a 

£ (n)*(r*,.pDp(p^) = in)Ap^p{p'‘cx.t) - / pi>p((r* . J] 

B 

- pDpp, - p £ (n)®p*(- tistr,,,-!) 

^ P^pf^s "1“ USpiX^^i 
^4-1 d+l 

£ (n)^p*cri^i., = £ (rt - a; 4- 1) 

= ?^ £ {n)x p^'^^ x{n)x p*^'^ ffx . • 

i“*0 

= rtpps - 4- 

The relation Dp(Tx,t, = is obtained from the definition equation of 

(with mi - ttp). 


The resulting recurrence relations for the three distributions are as follows: 

(4.7) >i.+i — nsp 7 (i,_i + pq Df p, Birwmal 

(4.8) /Xi+i = -{- a Vo pi Poisson 


* Jordan, loo. cit. or E. C. Molina, An EafpaTmon for Laplacian InlegraU . . . , Bell 
System Technical Journal, 11, p. 671, 
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( 4 . 9 ) 


where 


M = (n + 1) 



S C) ^ 

S ^ 1, r -- 1, — 1) j 


Hy'p&rge&metric 


Ki 

K, 


— It _ a ^ 

?i(tt + 1) “ " n 

{I - 1) if - 1) 

{n - 1) 


It 

n 


The last of these, which appears to be new, seems to be of formal interest only. 
The coefficients 0 -*,. are related to the Stirling numbers by the expression: 


^*.4 ^ ( 1) 

v=D 



A—Z 

I B — -11 


and consequently can be exhibited with detached coefficients in the form 
flo + + fla + * • ' + I’or the binomial and Poisson distributions 

certain simplifications, to be developed in the section following, in equations 
(4.3) and (4.4) may be made. For the hypergeometric distribution it appears 
necessary to use equation (4.6); the following short table of tra-.,, employing the 


detached coefficients mentioned above, is given for this purpose : 

\ flS . ^**1 

i\ 

0 

1 

2 

8 4 8 

1 

0-1 

1 



2 

0+0+1 

1-2 

1 


3 

0+0+0- 1 

1-3+3 

3-3 

1 

4 

O+O+O+O+l 

1-4+6-4 

7-12+6 

0-4 1 

5 

O+O+O+O+O-l 

1-6+10-10+6 

16-36+30-10 

26-30+10 10-6 1 


5. Binomial and Poisson Moments About the Mean — Simplified Formulas. 
5,1 Binomial. From examination of the first few moments about the mean, 
it appears expedient® to write the formulas: 

nz,=^T,ax.<u{nnT 
(5.1.1) . 

HZi+l = (e - p) S «* , 2a+l (.TLpqY 

T— 1 


* The kind of expression chosen admits of some variety. A re(^ur^ollce relation for 

B 

coefficients in the expansion nt => ^ oir.tP* been given by E. H. Larguier, On a Method 

For Evaluating the Moments of a Bernoulli DisltibxUion, Bull. Am, Math, Soo,, 42, 1, p. 24 
(Abstract 8) r I am indebted to Mr, Larguior for the opportunity of examining his results 

1 I ' 

in advance of publication, 
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When these are substituted into the moment recurrence relation, the coefficients 
are found to be related aa follows : 

+ (2s ^ l)ilfx-l,2*-2 
-2pQ[l -b 2x 4- 2'pqDpg]ax,i,^i 




or, in general, 


a^.a+l = [X + + SO£j-l,*-l 

- Pff[l “ (-1)*] [1 4- 2x + 2pqDj>g]ax,s 
Using detached coefficients of powers of pq as outlined above, these coeffi- 


cients 

\® 

may be exhibited aa follows: 




A 

' 1 


2 

3 

4 

2 

1 





3 

1 





4 

1-6 . 

3 

1 



5 

1 - 12 

10 




6 

1 - 30 + 120 

25 - 

- 130 

15 


7 

1 - 60 + 360 

56 - 

- 462 

105 


8 

1 - 126 + 1680 - 

5040 119 

- 2156 + 7308 

490 - 2380 

105 

9 

1 - 252 + 5040 - 

20160 246 

6948 + 32112 

1918 - 13216 

1260 


It may be noted that the coefficients of the first column in conjunction with 
equations (5.1.1) give the binomial seminvariants. 

Equations (6.1.1) make the coefficients functions of pq only; a slight alter- 
afcion makes the coefficients functions of n only, Thus : 

E 

(5.1.3) 

/i2.+i = (g - p) E /3*.2 .+i {pqT 

x-l 

and the coefficients are found to satisfy the recurrence relation: 

(6.1.4) ^ "b ^ [1 ( 1) ](2x — l)^r^i,i. 

These coefficients may be exhibited by a rearrangement of the table given 
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above as may be seen by comparing equations (5.1.1) and (5,1.3). The first 
few coefficients arc ns follows: 



1\ 

1 2 3 

2 

1 

{ 

3 

) 

1 

4 

1 -6 + 3 

5 

1 -12 + 10 

fi 

1 - 30 + 25 120 - 130 + 15 

1 

5.2 Poisson. 

The Poisson moments about the mean may be expressed as 

follows: 

1 

fi/21 

(5,2,1) 

1^1= 0!j,aO! 

*“0 


where [ ] represents “integral part of' and 


(5i2i2) ~ ^^x,a d* 

The coefficients oii,, are the constant terms in the expressions for the corre- 
sponding binomial distribution coefficients in powers of pf. 


Bell Telephone LAnonAToniBs. 



NOTE ON ZOCH’S PAPER ON THE POSTULATE OF THE 

ARITHMETIC MEAN 

By Albert Wertheimer 

1. Introduction. There appeared recently a paper by Richmond T. Zoch^ 
entitled "On The Postulate of the Arithmetic Mean." The stated purpose of 
hia paper, was to show that the derivation of the Postulate as given by Whit- 
taker & Robinson, is not correct. It is the purpose of this paper to show, 
that Zoch has not proven any error to exist in the Whittaker & Robinson deri- 
vation, but that there are a few errors in hia paper. As this paper is intended 
to be read with Zoch'a paper as a reference, the terms used there will not be 
redefined' here, and except where otherwise stated, the symbols used will have 
the same meaning. 

2. Zoch introduces the function 

/ s 5 + afia/ju3 

and claims that it satisfies all the four axioms of Whittaker & Robinson, and 
obviously it is not the arithmetic mean. He therefore concludes that their 
derivation must have errors somewhere, and proceeds to find them, Let us 
first examine the / function. Considering only the part /la/na , the partial 
derivatives with respect to n;,- are given by 

3JU2{(a;^ -- — jtial 2p3(a;i — S) 

2 

nui 

It is then stated (p. 172) "... clearly these partial derivatives are single valued 
and continuous. Therefore the function /i 3 /fi 2 satisfies axiom IV," Now, 
the condition that a function be continuous and single valued means of course 
that this be true throughout the region of definition of the function. It is not 
shown how these derivatives are clearly continuous and single valued for the 
very important case where all the a;'s are equal and the derivatives become 
indeterminate. As a matter of fact they are not continuous in this case, and 
therefore the / function does not satisfy axiom lY. To prove this, we only 
have to consider the very simple case where we let 

iCi' = fc + CiZ 


iThia Journal Vol. VI no. 4, Dec. 1935, pp. 171-182. 
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where k is & fixed constant, Ci is a set of- arbitrary constants not all equal, and 
2 is a parameter. We then have 

= k cz 

f j 

, Ma = mz 

t s 

Ms = MiZ 

where 

c = 1/n S Cf 
Mi = ^/n 2 (Ct- “ 

. Ms = 1/n (ci - c)® 

Substituting these values in / and the derivatives, we get taking o = 1, 

/ = fc + 25 -f 

g//a^, = i/« 4- 3»*w(»*(c. - if - »Vij - - e) 

nz^Mi 

Now going to the limit when z approaches zero, and all the sc's approach k, 
we get 

limit / ^ A, 

limit a//8i( = 1/»| -2 + 3(c, - - 2 4{e, - c)/w’) 


Thus, when all the jc’s approach the same value, the function / also approaches 
the same value independent of the c's, that is regardless of the mode of approach, 
while the derivatives can take on any vaiue depending on the c's that is on 
how the limiting value of / is approached. The / function then does not have 
continuous single valued partial derivatives, and therefore does not satisfy 
axiom IV. 

In part 2 of the paper it is stated ^‘Now when the Xi all approach a then both 
/ and df/dXf become indeterminate forms. However, in this case/ takes an 
indeterminate form which can be evaluated and it can be shown that Ms/Mi 
will always have the value zero, i.e,,/ will have the value a when all the Xf a] 
while the df/dXi can take any value whatever and in general the df/dXi will 
not be equal when the Xi a" This statement really amounts to saying that 
the / function does not satisfy axiom IV, but it is there used to demonstrate 
that one of Scliiaparelli^B propositions is false. 

3. Having exhibited a function different from the arithmetic mean, and sup- 
posedly satisfying all the four axioms, the question is asked "Where is the proof 
given by Whittaker & Robinson lacking in rigor?" After numbering the 
various steps in the derivation "... for the sake of rigor and careful reasoning 
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. . it is stated (p, 174), ^‘The sixth* step involves the tacit assumption that 
the partial derivatives arc functions of k. These partial derivatives are not 
necessarily functions of k . . and it is therefore concluded that the sixth 
step is not valid. Now, how can any function that by definition is to be evalu- 
ated at Okxi not be a function of 7c? What is shown (pp. 174-5) is that 
these derivatives do not necessarily involve k explicitly, but this is neither 
implied nor necessary for the sixth step, and there is no ground for doubting 
its validity. 

4. In order to overcome the supposed defect in the sixth step, it is proposed 

to change axiom IV so as to require the partial derivatives to be constants. 
But even then (p. 175) . . there remains an objection in the seventh step,” 

Now, the seventh step consists of the statement that if 

4>{xi) = ^CiXi 

where the c’s are independent of the jc's then due to the condition that 0 be a 
symmetric function, all the c^s must be equal. To show the defect in this 
step it is stated, that under certain conditions "... the function / ^ a; -f 
will have partial derivatives with respect to Xi which are unequal and constant; 
yet at the same time the function / is a symmetrical expression of the n vari- 
ables.” Granting that all that is correct, what has this got to do with the 
seventh step? The / function certainly is not of the type 2 CiX,- to which 
the seventh step is applied. 

5. One more point should be mentioned. On p. 181 it is supposedly proven 
that any function satisfying the first three axioms must have continuous first 
partial derivatives. The proof is essentially as follows: Assuming all the a's 
arc given the same increment Ax, the increment of the function then is A0, 
It is then stated “. . . but by axiom I, = Aq;. Therefore A(7»/Aa; = 1 = dip/dx. 
In other words, the total derivative of ^ exists and is constant, Therefore the 
total derivative of ia continuous.” From this, the continuity of the first 
partial derivatives is proven by means of Euler’s Theorem for homogeneous 
functions. Now, just what does the symbol d(f>/d3C (which is called the total 
derivative) mean for a function of many independent variables? Besides, 
(whatever this symbol means) is, it considered rigorous to deduce a general 
Theorem from the very special cose where all the differentials are made equal? 
This is one place where the / function could be used effectively as an exhibit 
of a function satisfying the first three axioms, and not having continuous partial 
derivatives. 

It is also stated (p. 181) that . it would seem more satisfactory to postu- 
late that the function <t> is single valued, for the single-valuedness of a derivative 
docs not insure the single-valuedness of the integral while the single-valuedness 
of a function docs insure the single-valuedness of the derivative lyb^re the 
deri\'^ative exists.” This statement is certainly not self evident and requires 
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proof. For a single variable at least, it is easy to imagine a function repre- 
sented by a curve with corners defined in a certain interval. The function then 
could be single valued everywhere in the interval, while the derivatives at the 
corners may exist and have two distinct values, depending on whether the 
corner is approached from the right or the left. On the other hand it is hard 
to imagine a curve representing a single valued function such that the integral 
i.e. the function represented by the area under the curve should not be single 
valued. 

6. In Conclusion: It is stated in the Introduction that “Since this book has 
had wide circulation, it is believed that the errors in this proof should be called 
to the attention of the users of the book. The present paper has been prepared 
for this purpose." It is for the same reason, that this paper was prepared to 
sliow that no error has been proven to exist. 


Bureau of Ordnance, U. S. Navy Department 



NOTE ON THE BINOMIAL DISTRIBUTION 
By G. E, Clark 


The purpose of this note is to show that 



m = (-1)” 


(?"nl 

TT 


/ pV sin Tg 

\g/ 


where is an integer ^ 0, 0 < p < 1 , p + 5 = 1 , and = x{x — 1 ) (a: ” 2 ) 
>"(x - n), is sl function whose values at x - 0, 1, 2, ■ • ■ n are the successive 
terms of the expansion of (g + p)", and also to consider the problem of fitting 
j{x) to an observed frequency distribution. 

The statement made about ( 1 ) can be verified by evaluating ( 1 ) as an inde- 
terminate form. On the other hand, ( 1 ) can be derived by observing that the 
x-th term (x an integer) of the expansion of (g + p)'‘ is 


( 2 ) 


n\ r(n + l)p*g’‘-* . 

xi(?i - x)r ^ r(x + i)r(n -x + iy 


then ( 1 ) can be derived from ( 2 ) by means of the product expansions for r(2) 
and sin x, This derivation of (1) from (2) can also be carried out by expressing 
(2) as a Beta function and then using 


B(x + 1, n - X + 1) 




(1 + 0 "+® 


dt - (-1)^ 


,(n+l) 


(?i-f 1)! sin ttx' 


This integration can be performed by means of the theory of residues. 

Consider the problem of fitting (1) to an observed frequency distribution. 
We shall write (1) in the form 


( 3 ) = + M* - 2 ) 

and determine the constants a, b, n, and h so that, when g is the mean of the 

observed distribution, F(z) will fit the distribution, 

The values of a, b, n, and h can be determined by the method of moments. 

Let J'2 , 1/3 , and j/4 , denote the usual second, third, and fourth moments of the 

distribution, which are calculated in the usual way (as in W, P. Elderton, 

Freguenci/^Curves and Correlation) and not adjusted by. any procedure such as 

2 

Sheppard's adjustments. Also, use the usual notation jSi = ^ and 

>*2 Vi 
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Then, the method of moments gives 


(4) 

( 6 ) 


= 3 

3 -f- 

2 + ni3i ± '\/n/9i(4 -f n^i) 
2 




a = (-!)’ 


/i(S/)n! 


, where S/ is the sum of the frequencies of the distribution. 


ir(l 4* 6)'^^ 

An integer n is chosen nearest the value assigned by (4). The two values of 
h from (6) determine two curves that are congruent but whose skewnesses are 
of opposite sign. Hence, h is uniquely determined by (5) and the sign of the 
skewness of the data, 

Por a symmetrical distribution, & = 1, rj — 0, and 

2 


n ss 




3-^2 

Vn 

2 "%/ P2 


We shall consider an illustrative example. In the following table the columns 
f{z) and fi{z) are taken from W, P. Elderton, Frequency'Curves and Correlation 
(1006), page 62. f(t) is an empirical frequency distribution, while fi{z) is 
obtained by fitting a Pearson Type II curve to the distribution f(z). fi(z) is 
computed from 


/,(«) = 1624 2 ^, X = 2.0973 + -SOSz 


which is determined by the method of this note. /a(2) is obtained by fitting 
the normal curve 

2 ( 1 . 820 ) 


Mz) = 486.1e 


z 

/(^) 

fiU) 

Mz) 

Mz) 

^3 

11 

18 

14 

19 

-2 

116 

107 

109 

92 

-1 

274 

281 

286 

263 

0 

451 

438 

433 

444 

1 

432 

437 

433 

444 

2 

207 

207 

285 

263 

3 

116 

106 

109 

92 

4 

16 

18 

14 

19 


The coefficients of goodness of fit for fi(z), f 2 (z), and faiz) 'are respectively 
.35, .58, and .02. 



CONVEXITY PROPERTIES OF GENERALIZED MEAN VALUE 

FUNCTIONS' 

By Nilan Noeris 


Consider the following generalized mean value functions: (1) the unit weight 


01 ' simple sample form, 0(f) = 


_ ( + 3:2 + ■ ■ ♦ H~ ^ n \] • 


n 


, in which the Xi are posi- 


tive real jninibors not all equal each to each, and in which t may take any real 
value; (2) the weighted sample form, <o(f) = + +_Cn^V 

\ Cl + Ca T “ ' T Cn / 

in \vhich the C{ are positive numbers not all equal each to each, and in which the 


Xi and t are restricted as in 0(0; (3) the integral form, 6{i) — 


x‘dx 


where 1 a;‘da: exists for every real value of t; and (4) the generalized integral 


f 


form = j where 0(a:) is a non-decreasing function integrable 

ill the Riemann-Stieltjes sense such that 0(«5) - 0(0) = 1, and such that 
x^dTp{x) exists for every real value of t. The facts that all of these func- 


r 


tions are monotonio increasing and that both 0 (f) and w(f) have two horizontal 
asymptotes have been previously demonstrated,* Although the existence of 
0 (f) and «(f) has been known since 1840, there appears to have been no attempt 
made to investigate the behavior of the second derivatives of them.® 

When the s,- are price relatives, production relatives, or similar data, 0 (f) 
andw(f) yield common types of index numbers by direct substitution of integral 
values of t. For any values of t such that 0 < fi < fa < « ^ the type bias of 
0 (f 2 ) will be greater than the type bias of 0 (fi), Similarly, for any values of t 
such that ^ « < fi < fg < 0 , the type bias of 0 (fi) will be greater than tlie 
type bias of 0 (/ 2 ). The second derivatives of 0 (f) and w(f) indicate whether 


' Presonted at a joint meeting of the American Mathematical Society, the Econometric 
Society, and tlio Institute of Mathematical Statiatica at St. Louis on January 2, 193G. 
The writer ia indebted to C. C. Craig, Einar Hille, Dunham Jackson, and J. Shohat for 
helpful ctiticnl reviews of the preliminary draft of this paper. 

* G. H. Hardy, J. E. Littlewood, and G. Pdlya, Inequalities (Cambridge University 
Press, London, 1934), pp. 12-16; luid Nilan Notris, ^ ‘Inequalities among Averages," Annafs 
of Mulhcmalical Slalislics, Vol. VI, No. 1, March, 1936, pp. 27-29. 

* Jules Bihnaymd, Sociit^ Philofiiatique de Paris, Extraits dea proc^s-verbnux des sdauces 
pendant I'annde 1840 (Imprimerio D'A. Ren6 et Cie,, Paris, 1841), S6anco du 13 juin 1840 

p. 68. 
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type bias is changing at an increasing or a decreasing rate as between the un- 
limited number of averages available for use. Considerable interest attaches 
to u(0j the weighted sample form of function. 

Let u{t) be made arbitrary for the case of n - 2, with xi = 1, and Xz ^ 
where X is any real number. Also let Ci - a, and d = fi, where a + j8 = 1. 

TUeiio>(0 = [a -h Now for all values of i, . 

For 1 1 1 sufficient/y small, it follows that 

log (« + -f i ^X' (1 - 0 + /3X’ 1^-, i + I ^ ^ 

so that for i 7^ 0 

1 log (« + = -/3X-h|^X^l -/J)f + i9X’’ 1^-1 4-^- . 
Therefore w(i) = exp, log (a + j 

= e-''|^l + i|3\"a -0)« + /3X*|-i + |-| + i^(l - •••]. 

It follows that u"(0) = 2|3XV'“ + |3)‘xl. It is clear 


L"0‘*'2"3 + 8^^^ 


It is clear 


that «(0) is the weighted geometric mean, and that 0(0) is the unit weight or 
simple sample form of geometric mean. As a means of demonstrating the range 
of values which u"(0) may take it is helpful to rewrite the expression for «^^(0) 
as follows: 


a»"(0) = 


= i^Ui-^)V[x-|^j] 


«■" = }i\ dl 


This consideration makes it possible to distinguish three cases of y = /(X, j3) 
for fixed /3j namely, 0<;3<^;/3 = ^; and | < j3 < 1. In all three cases 
/(X, 0) has an absolute minimum ^t(j3) S 0, and /i(^) = 0. The corresponding 

values of \ satisfies the quadratic equation X^ - 1 i X + ^ ^ 0. 

V 6 pU — pj p (.1 — p; 

It is clear that by taking 0 near enough to 0, one can make ju(P) as large negative 
»as is desired. Also, by choosing X properly, one can make ^^-ke any 
value between fi(0) and <» . For example, when a = = |, X may be selected 

so as to make any arbitrarily chosen non-negative number. For then 
X* 

® ^ fi’id as X increases from — « to 0, w"(0) decreases from to 
64 

0, If X = 0, w^'(O) = 0' If X > 0, os X increases from 0 to 8, increases to 
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64e”*, and as X increases beyond 8, «''(0) decreases, approaching 0 as X increases 
indefinitely. It is evident that the case of a =* with X = -log 2, = 1, 

and 2)2 = is one in which wW becomes the unit weight or simple sample 


type of generalized mean value function, namely, ^(i) 



Reference 


to the first expression above n( 
\pi in this special cose. 


for will make clear that ^"(0) = 


Analysis of $(i), the generalized integral form of generalized mean value 
function, makes it possible to characterize populations of a very general char- 
acter, as well as samples, But in the case of $(<) it is even more difficult to 
generalize as to convexity properties. For example, let 


m 






dE(u) 


where 






This expression is obviously of the required generalized integral type. Now 





* e' 

Therefore $(i) - and $"(t) =» — > 0 for all i, That is, in this particular 

case, $(t) has only one horizontal asymptote. 

The foregoing examples indicate that the following conclusions may be drawn 
as to the diverse convexity attributes of the various means as functions of t: 
(1) The unit weight form, and the weighted sample form, u(t), must always 
have a point of inflection, since both of them not only increase with t, but are 
doubly asymptotic (have two horizontal asymptotes). (2) Points of inflection 
for 0(0 and w(0 do not necessarily occur at t ^ 0. (3) The generalized integral 
form, $(0, need not always have a point of inflection. That is, the second 
derivatives of certain forms of <^(0 do not change their sign, since such forms 
are concave upward. 


Urivbrbitt op Michiqatt. 



A SIMPLE FORM OP PERIODOGRAM 


By Linsmoee Alter 


Schuster's introduction of a method of systematic search for hidden periddici' 
ties and cycles opened a new field for the investigator of statistical data. The 
beauty of his method in its analogy to analysis of light, and the great reputa- 
tion of its author, combined to give it universal acceptance and to blind statis- 
ticians to its faults. I 

In more recent years at least three new mathematical and two mechanical 
forms of periodogram analysis have been proposed, each of which exhibits 
certain advantages over the original one. The use of the term periodogram 
for these forms is an extension of Schuster’s original definition which used as 
abscissae quantities proportional to the squares of the amplitudes of the sine 
terms found in the data for the various trial periods, He wrote: "It is con- 
venient to have a word for some representation of a variable quantity which 
shall correspond to the spectrum of a luminous radiation. I propose the word 
periodogram and define it more particularly in the following way: 


'ii+r ru+T 

Let hTa I f(t) cos kidt and iTb - / J(t) sin kidt 
hi Jh 


where T may for convenience be chosen equal to some integer multiple of , 


k 


2 ^ , 

and plot a curve with as abscissae and r ~ ordinates; this curve, 

K 


or better, the apace between this curve and the axis of abscissae, represents the 
periodogram of 

The following appear to be the essential criteria for a satisfactory form of 
periodogram: 

1. It must exhibit plainly any repetition of form in the data regardless of 
how irregular the 'shape of the repeated interval may be. In doing this it 
must exaggerate the amplitude of the main terms at the expense of the 
lesser ones. 

■2. The calculation of the indices must be short. In a periodogram from 
many data the indices sometimes are computed for several hundred trial 
periods. 

3. There should be a geometrical interpretation of the index used,, 

4. The frequency distribution of the index must be known. 

6. Combining or smoothing the data should modify the index in a manner 
which leaves an obvious interpretation. 
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The Schuster periodogram has the following disadvantages; 

1. Only sine terms of large amplitude arc exhibitfcd. A perfect repetition 
of an extremely irregular form of data would not be indicated in any way. 

2. The calculations are long. 

3. There is a considerable uncertainty in the length of the period found. 
Those methods of analysis which use harmonics as well as the fundamental 
have much less of this uncertainty. 

The correlation periodogram has advantages in each of these points over the 
Schuster. However, even ;vith it the eafculations are fairly long. Furthei- 
more, the modification of the coefficient introduced by grouping or smoothing 
is not a linear one, 

The periodogram described here is a slight modification of one for which a 
preliminary note was published in 1933.. Additional features have been studied 
and its applications to many data have shown its ease of calculation. This 
calculation has been reduced still more by a mechanical method which renders 
it practicable to contemplate the possibility of studying many data hitherto 
prohibited by excessive cost. 

Consider data ato , , a;# , • • • a:,- , - ♦ * Let I be any inte^r less than n. 

Form the sum of the absolute values of Xi ^ designated by 2 | 


Define A - 2 i takes the values of the various trial periods and 

1-1 n ^ I 


is called the lag. A, therefore, is the mean error between prediction that data 
will be repeated after a lag of I and the fulfillment of the prediction. Such 
an index has a meaning that is immediately of use to a meteorologist or other 
investigator, Coefficients such as the Schuster and the correlation coefficient, 
although valuable statistically, are of less immediate interest. 

The standard deviation of these errors of prediction follows at once from 
standard formulae under assumption of normal distribution. 


(T = 1.25 A 


The distribution of <?•, as computed from the absolute values of data, has 
been studied by Helmcrt and by Fisher. Davies and E. S. Pearson have com- 
pared the various methods of estimating <r. For the large number, (n — 1), 
pairs of data used for a periodogram point, this method becomes almost as 
precise as the usual one which would square 'the values of — Xi-i). For 
{n — Z) as small as 60, the standard deviation of the standard deviation by this 
method is only seven percent larger than by the other one, Fisher has shown 
that 

-s/n— I 

This may be written as 

1.068 tr 

— 1) 


V 


TT ^ 2 


as (n “ Z) — > w 
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Tlie distribution approaches normal rapidly and for all values of (n - i) that 
would bo used in periodograin calculation certainly may be considered as normal. 
It will be very seldom that a value of (ti - 1) much smaller than 200 will be 
used. 

i 

The data may be printed on two strips of adding machine tape held together 
by clips so as to match data separated by a lag 1. In arranging them for investi- 
gation, it usually is most convenient to make all numbers positive. The 
computer subtracts mentally and puts the difference into an adding machine, 
which gives him A almost immediately. 

For some computers, and especially where the numbers are large, another 
method of obtaining A may save time or lead to less numerical mistakes. The 
computer will form the sum of all his data. He will, as for tlic other form of 
computation, put these on two pieces of adding machine tape that he lays side 
by side. However, instead of putting the difference of the pairs into the ma- 
chine, he will, in each case, put in the smaller datum of the pair. Then, 

(n - 1) A I = 2 S all data - [2 1st (n - 1) + last (n - 1) data] 

— 2 smaller 

The derivation of this equation is obvious. In computing by this method the 
subtotaler on the machine can be used to make the strip of sums of the first 
{n - 1) data and of the last {n - 1) for all values of 1. The first term on the 
right hand side is a constant, the last is twice the sum of the smaller numbers 
chosen in the pairs. I have computed by both methods, and where the numbers 
are small, I prefer the former. Where they arc large, I prefer the latter. How- 
ever, when one must use comparatively untrained computers, lie will find less 
mistakes made if the computer docs not make the subtractions. 

The calculation of A is much shorter than that for the indices even of the 
correlation and variance periodograms. It may, however, be shortened even 
more by a mechanical arrangement, (n - is the area between two histo- 
grams of the data matched after a lag 1. These may be carefully graphed on a 
large scale and two such graphs superposed over a table with a translucent 
illuminated top. On the edge of this tabic is the track to guide a rolling pla- 
nimeter. A, as computed by this means, is accurate to approximately oue-half 
of one percent of its value, a much more exact value than is needed. The 
details of such a device as constructed for the Griffith Observatory are shown by 
tile accompanying photograph and diagram. The dual sailing of time by fclie 
method and by its mechanical application have resulted in the adoption of a 
much more ambitious program of meteorological research than previously was 
contemplated. 
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flF'riTa 

, rT,r»jtt4,uB4 A»|kl1 


ScAi/E Diagram of Planimeteu Device 



Planimeter Device for Mechanical Calculation 
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The form taken by the peiioclogram is important. Consider tlic simplest 
case, data which follow a sine curve. 


yi = a cos 


iji - iji-i = 2a sin — 

P 



The term in brackets takes values distributed around the circle and the psirt 
outside is a constant for any one lag. The bracket term sums approximately to 


2(n — 

TT 


since we consider all terms as of one sign only. 


A 


I — 


4a . tI 
— sin — 

TT J) 



If the absolute values were not considered in the expression for A i , the periodo- 
gram would be a sine curve of period 2p. The lack of sign gives a cusp curve 
with the cusp at lags p, 2p, etc. Such a form is advantageous in that the 
periodogram gives sharp peaks at multiples of the periods which may exist. 

The effect of the periodogram in exaggerating the principal terms at the 
expense of the smaller ones may be obtained most easily by equating <r as- 
obtained by the linear and the quadratic formulae. 
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The data may be written as the sum of cosine terms 

( 2irl ^ »o \ , T f^Tri — 


-f ' ■ • + C, 


Vi - Vi-i 


2a sin — 
Va 



3^(2 i ip ^ 

Vo 


+ • • ■ + (C( — Ci-j) 


2 ivi - yt-iY - 3(7i - iW sin^ — + 2(n - Ofc* sin® {n - 1) ■\/2 ol 

Pq Pb 

The sine terms contribute to A] in proportion to tlio squares of their ampli- 
tudes. On account of the sin® — factor, they contribute very little to values 

Vi 

irl 

of ill for ^vhich — is not very closely an even multiple of v. 

Pi 


This method has been applied to rainfall data of the Pacific Coast and has 
proved as satisfactory in practice as would be expected from the simplicity 
of the theory. The peiiodogram of rainfall stations along the northern third 
of the California coast is shown here, exhibiting perhaps the most definite 
single piece of evidence over found for rainfall cycles. Outstanding is a cycle 
of about 45 years with its fourth harmonic as the secondary feature, The 
writer expects to publish the results of that work in the Monthly Weather 
Review, 
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ON CERTAIN DISTRIBUTIONS DERIVED FROM THE MULTINOMIAL 

DISTRIBUTION ‘ 

By Solomon Kxjllback 


1. Introduction. With the multinomial distribution as a background, there 
may be derived a number of distributions which^are of interest in certain prac- 
tical applications. Several of these distributions are here presented and the 
theory is illustrated by specific examples. 


2. Preliminary data. In the discussion of the distributions to be considered 
there arc needed certain factorial suras whose values are now to be derived. 
In the following discussion only positive integral values (including zero) are 
to be considered. 

There is desired the value, in terms of iV, n, r, of 



Mn, N) 



N\ 

Xi\ Xi\ " • aPnl 


where the summation is for all values of xi , ica , • • • ,Xn such that xi + atj + • ■ ■ 
-\-Xn = N and no x is equal to r. . 

Let us first consider the case for r = 0; i.e., we desire a value for the sum in 
(2.1) for all values of Si , aij , • " , such that aii + % -f- ■ * • + a:n = iV and 
no X is equal to zero. By the multinomial theorem, we have that® 


(2.2) 


(tti ffla “h ' • • “b “ ^2 


N\ 


‘ x„] 


aV 


a 


zn 

n 


where the summation is for all values of a:i , a:? , * ■ * , such that aji H- ■ 

Xn — N. If ni = ^2 = * • • — o„ = 1, then 


(2.3) 


n 


"=E 


jvi 


Xi -|- X2 "h ■ • • -j- asn ~ AT. 


3Ji! iCal • ■ ’ ®n! ^ 

The sum in (2.3) may however be rearranged into the sum of a number of 
terms as follows : 

N\ 


n 


(2.4) 


a:il 2 : 2 ! •• • iCnl' 

p Nl 


n(n — 1) 


N\ 




a:i + X 2 * ' • + JCn IV, no X - 0; 
xi 4- xj d- • ■ ■ -j- x„_i = N, no X = 0; 

, Xi + X 2 -f • ■ ' -f = no X =s 0; 


0 


ivi 

r J ^ xjxsl “ ■ Xn^J' 


xi + *2 + ” ■ + x„„r = N, no I = 0. 


1 Presented to the Institute of Mathematical Statistics January 2, 1036. 

» H. S. Hall & S. R. Knight, Higher Algebra, MacMillan & Co., 4th Ed. (1924), Chap. 16. 
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Thus we may rewrite (2.3) as 

n" = hin, N) + nfo(n - I, N) 


(2.5) 


nin " 1) * / 

H 2! — 


— 2, JV) + ’ • * + — »■, iV) -h • • • 


Keplacing ti by w — 1 in (2,6) there is obtained 

+ (n - l)/o(* - 2, JV) + ••■ + (” 7 - T ~1,N) + •• • 

Multiplying (2.6) by n and subtracting the result from (2.5), there is obtained 
- nin — 1)^ = Join, N) 

(2.7) r(^;J^j)A(n-r-l,N)- ... 

Replacing u by u — 2 in (2.5) there ie obtained 

, {n-2f = Mn-2,N) 

(2.8) /« _ 2\ 

4-(w-2)/,(n-3,Ar)+ ... +(^”_^j/o(7i-r-l,JV)+ 

Multiplying (2.8) by n{n — l)/2 and adding the result to (2.8), there is obtained 

_ D" + 2(2^ („ _ 2)'' = Mn, N) + 

(2.9) 

Uin - 3,N) + ... + (^ ™ - r - 1, W) + .. . 

Continuing this process, there is finally obtained the result that 

(2.10) /,(r, JV) = n'' - n(ti - 1)" + ^ (71-2)" ± »• l" 


It may be shown^ that the right side of (2.10) is A”x^ for a; = 0. The author 
has elsewhere obtained (2.10), but by a special procedure not applicable to the 
general case.* 

We may readily verify (2.10) for example, for w = 3, M = 6. If Xi + a?2 
+ 2:3 = 6 and no a: = 0, then the sets of solutions are (3,1,1), (1,3,1), (1,1,3), 

(2,2,1), (2,1,2), (1,2,2), and/, (3, 6) = 3.gi||ji + 3^j|L.= 160. From (2.10) 

there is obtained /o(3, 5) *= 3® — 3,2® + 3.2/2 = 150. 


’ E. T. Whittaker & G. Bobinson, The Calculus o/ ObservationSj Blackio & Son Ltd. 
(1924), p, 7. 

* 8. Kuhbaok, “On tho Bernoulli Distribution, " Bull. Am, Malk. Soc,, December, 1936. 
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For the goneral case, wo return again to (2.3) and rearrange the rJglit side 
into the sum of a number of terms as follows: 





( 2 . 12 ) 


vT - /rCn, N) + '^frin -hN-r) 


+ — 21(r i )« ' -■%N~2r) + 


where at'*' = N(N - 1)(JV - 2> • • ■ (W - i + 1). 

Replacing n by n — 1 and Nhy N — r in (2.12) there is obtained 

(2.13) 




r! 


/r(n - 2, - 2r) 4- 


Multiplying (2.13) by 
obtained 


7 ^ 

rl 


and subtracting the result from (2.12), there is 


(2.14) n" - ^ (n 

Ti 


!)''■'= /r(n,JV) 


nCn - 1)JV“'’ 
21 (r 1)2 


f,(n-2,N-2r}- 


By continuing this process, in a manner similar to that used for the case r = 0 
there is finally obtained 


/,(n, N) = 

(9.16) 


nN' 


(r) 


r\ 


(n - 1)"“' + 


n(n - 1)N'“ 
2!(r!)2 


(n-2) 





(„ - 3 )"-'' 


+ 


By setting r = 0 in (2.16), there is of course obtained the value already 
found in (2.10). 

We may readily verify -(2.16) for example, for w = 3, W = 5, r =s 2, If 
Xi -i- Z 2 + Xy = 5 and no « = 2, then the sets of solutions are (5,0,0)j (0,5,0), 
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(0,0, B), (4,1,0), (1,4,0), (1,0,4), (4,0,1), (0,1,4), (0,4,1), (3,1,1), (1,3,1), (1,1,3^ 
and/j(3,6) = 3-6I/61 + 6-6I/4! + 3-61/31 = 93. From (2.16) there is ob- 
tained /aO, 5) ^ “ 3-6-4-2V2! + 3-2*5-4-3-2/21{2l)® = 93. 

The same method of procedure may be applied to evaluate 


(216) 


Thus, there is derived the result that 


aJi + -f* ' - • + a!n == iV^, 

no a: «r,s,- 


(217) 


, JV) = «*_«(- 

2l(rl}“ 


AT'^Cn - I)"-- , Ar‘‘’(« - 1) 


+ 

+ 

+ 


+ 


4- 


sl 




- 2 ) 


-V— r— I 


2! (Jl)> / 


»(n — 1)(» 


(rl) (,!) 

«'’(« - 3)"-*’ 


- 2 )(« 


31 (r 1)2 


or f. 




2l(H)2(sl) 


21 (rl) (s!)2 


+ 


31 (s!)2 


) 


We may readily verify (2.17) for example, for n = 3, W » 5, r 0, s *= 2. 
If + xj + * 5 and no a; = 0 or 2, then the sets of solutions are (3,1,1), 

(1,3,1), (1,1,3) and /os(3;6) - 3‘5l/3l 60, From (217) there is obtained 

/m(3,5) = 3“ - 3(2' + 6 - 4 . 272 ) + 3-2(1/21 + 6-4/21 + 6-4-3-2/(2l)^) = 60. 
It will be shown later (see section 8) that 


(0) 


/,(n, N) = Un, Af) + ^ frM -i.N-t) 


(2.18) 


, n(n - l)iV<'"> , , „ „ „ X , 

21 (sij* + 


(2.19) 


/.(», W) - Un, ^0 + ^5^’ /r.(« - 1. 2V - r) 


21 (r!)® 4" 


From (2.18) and (2.19) there may be derived, by 
employed in deriving (216), that 

/,.(«, N) = Mft, N)-^f,(n-\,N-s) 

( 2 . 20 ) 

nCti’ — l)jPf ® 
21 (s\y 


a method similar to that 


■) 

-/,(»t-2,JV-2s)- 


Thia latter result also follows from (2,17 and (2.16). 


t « I 
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Let us now consider the following generalization of (2.1). There is desiretl 
in terms of iV', n, r, ai , aa , • ■ • , a„, the value of 


(2.21) Fr(n, N,ai,aa, ■ ■ • , On) = S 


Nl 


iCll Xal • ■ ' JEnl 


fli* fla" ' • • a„ 


where ai j aa , • • • , a,v , are constants and the summation is for all values of 
Xi,X 3 , * * ' , such that .Ti -H aja + • ■ • + rtn - iV and no a: = r. The method 
of procedure is the same as that for the case already considered, viz when 

ai = ct-s = • • • = dn “ !♦ 

The sum in (2.2) may be rearranged into the sum of a number of terms as 
follows : 


( 2 . 22 ) 


iVl 


xd X 2 ! ’ • ■ 

Nl 


ad flj* * ” ftn", xi + xa + • * • + Xn = iV, no X = r; 


_r 

rl • Xnl 


as* ■ ■ ■ En'* + • * • + ^ 


N\ 


ad ' • • ad'i , 


rl ^ Xi\ • ” 
xi + X 2 + • ■ ‘ + x„_i = N — r, etc., no x = r; 


a\ • al 


N\ 


- • • a7 + 




(rl)* 


X)t+ll “ ' ‘ x„l 


+ 


Ctyi— jt+1 * * * 


N\ 


ad ‘ 


(r!)* 

xi + xs + * • ‘ -h - N — kr, etc., no x = r; 


For convenience, let us write 

A(n, N) = (oi + as + • • * + On)'^ 

Ai{n — 1, JV) = (ai -H ■ ■ • + a,'_i + aj+i + • • ■ + a„y 

Aij{n — 2, JV) = (ai + h cn-i + ar+i + h a,_i + a,+i H + 


(2.23)^ 


Orin, N) - Fr{n, N,ai, at, * • ■ , a„) 

Grin - 1, N, di) ~ Frin - 1, JV, ai, os, > • • , aj-i, a,'+i, • ■ - , a„) 

Grin — 2, a,-, a/) = F,in — 2, Oi, • • • , a<-i, aj+i, • • ■ , a/_i,o,-+i, • • • , 0 ^) 


so that (2.2) may be written as 


(r) n 


(2.24) 


il(n, N) = Grin, ^ E o! a(« - 1 , JV - r, a,) 

ri i— 1 

n 

+ ^r(n -2,N- 2 r, m , o/) + 


(t j, etc.) 
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From (2.24)j there are obtaiaed n eqxiatLona 


(2.25) 


Ai(n ~l,N-r)^ 0,(n -l,N -r, a,) + ^ 

T’l 


trl 


- 2yN - 2r»Oi,aj) + 


(i := U 2j ■ ■ • , 71, jV 1) 


Multiplying (2.25) by a!'iN^^^/r\ and subtracting the result from (2.24), there 
is obtained 


A{n,m - E -^Un ~ 1, W - r) = 6,(«, W) 


(2.26) 


i:=i 


r! 


iV 


{ir) n 

2 a\a)Gr{n — %N — 2r, a,-, a,) — 


21 (r!)“ ipi 


(i j, etc.). 


Continuing this procedure, there is finally obtained 


N 




(2.27) 


6*, ( 71 , JV) = iV, oi, flj , ■ • ' , tfn) ~ A(n, N) - 

- lyN ~r) + Yj aUiAain -2,N - 2r) - 

i-l "I V U 


{i jf etc.) 


Similar results are obtainable for 

(2i2S) (?ra . I <( ^ N f (Jl , Q 2 7 ' ' " ; ®n) ~ 


iV^I 


adiCjl ' ■ ‘ ainl 


Hi* Os* (fn 


where the summation is for all values of .'i;,- such that Kj -I- 3^2 + 
and no X ~ r, s, • • • , or t. 

Thus, it will be shown later (see section 8), that 


"h — ■^7 


(2.29) 


, wO) n 

Grin, N) =f Griin, N) -{ r- 2 - If JV - 5, at) 

S I 4^1 

^12») » 

2!!^* 2,iV ^ 2s,ai,a,-) + ■ • - 


(i ^ j, etc.) 


Corresponding to the derivation of (2.27), there is obtained from (2.29) 
the fact that 


(2.30) 


TyO) fl 

e„(n, W) =i (J.(n, ff) _ " 2 a; (?,(«- 1, W - s, o,) 

SI I 

^(2i) n 

+ 2 ahjGrin - 2, iV - 2s, fli, a,) - • • 


21 {a\y 




(i 7 ^ j, etc.) 


3. The problem to be studied. Consider a trial in v^hich one of n mutually 
exclusive events may occur, with the respective probabilities of occurrence 
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Pi f Pi j " j pn where }?i + P 2 + ■ • • + ?)„ — 1. The probabilities of the 
various combinations of events wliich are possible 111 N trials arc given by the 
terms of the expansion of (pi + Pa + • * ■ + Pn)^. 

In the N trials some of the possible events may not occur, others may occur 
one, twice, etc. It is desired to study the distribution of the number of events 
which do not occur; the distribution of tho number of events which occur once 
each, etc. The simultaneous distributions of the events above described are 
also to be studied. 

For example, the possible event may be the occurrence of a digit. A study 
of a sequence of random digits, in sets of ten, yielded the following three 
sample sets. 


b 

1 

2 

CO 

4 

6 

6 

7 

8 

9 

1 

0 

2 

1 

1 

2 

1 

0 

0‘ 

2 

1 

1 

1 

1 

1 

1 

2 

0 

1 

1 

0 

0 

2 ; 

1 

2 

1 

2 

1 

0 

1 


Fig. 1 


In the first set three events do not occur, four occur once each, and three occur 
twice each. In the second set one event does not occur, eight events occur once 
each, and one event occurs twice; etc. 


i. Distribution of the number of events not occurring. To obtain the distri- 
bution of the number of events which do not occur, there is applied to the 
expansion of (pi + pa + • • • + Pn)^ a procedure similar to that employed 
in section 2. 

Thus, if iTro represents the probability for r events not occurring, then 


iroo 


TTIO 






iVl 


Xil iCsl • • • Xji\ 

fvl 


vVpV ■ • • Pn") .11 -|- 3^ H- ■ ■ ■ + aCn = JV, 

no a; = 0; 


(4.1) 


X2\ 


pa’ ■ ' • Pn" + h S 


m 


\vV ”■ vl-i, 


"" ■ ■ ^a:i!..-a:„_i! 

+ aca + ■ ■ ■ “[- a:«_i = N, etc., no s - 0; 




= 2: 


N\ 


N] 


\VV ••• V‘n~r, 


— j — —I p^+V • * • p*" + ■ " + S 

Sr+ll ' ' ' ^n' ' 3/n"-r* 

a:i + asa + ■ * * + aJn-r - JV, etc., no = 0; 
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Employing (2.21), we may write (4,1) as 

ITOO “ I Pi j Pi > * ' * > pO 

(4 2) ^ ~ ~ ' ' * ' • ■ ' + ^a{n — 1, iV, Pi , p2 j • • • , Pn-i) 

JTri - ii'o(7^ “ T, N, pr+1 , ■ • ■ , pO ' ■ • + ~ ‘ , Pn^r) 

Since pi + Pi + ■ * * + p« — 1 there is foimd from (2,27) that 

ffoo = 1 - £ (1 - PiY + i 2 (1 - Pi ~ P/)^ 

<-i A I i,j-i 

- ^ Z (1 - Pi - P; - p*)" + • • • 
irio = E (1 - Pi)"^ - Z (1 - p.— p,/ 

t“i fij-i 

+ ^ Z (1 ' Pi ’ p,- - p*)"' — 
2! 


(4.3) 


Trig 


= ^1 ( Z (1 - p< - p/)^ - Z (1 ^ Pi - Pi - Vkf + • • • 


TTSO = ^ I Z (1 - Pi ^ Pi ^ VhY -'■■*} 

dl (i.i,k-l J 

5^ i, etc.) 

The factorial moments* of the distribution given by (4.3) are easily derived. 
The first factorial moment is given by ffi = -n-io + 2ir2o + Sirso + ■ • • + riTro -!-■•• 
^nd the summation of the proper terms in (4.3) yields 


(4.4) 


<^1 “ Z (1 “ p<)^ 


i-1 


In general^ the r-tli factorial moment, given by ^ — 1) * • • 

fc-r 

(/c - r + :)7rA< is 

(4.5) Cr~ Yj (1 - Pa - Pi - • ■ • - Pr)^ (tt 5, etC.). 

-ir—l 

Indeed, (4.3) illustrates the fact that, if J{x) is the probability that a. discon- 
tinuous variate takes the value a;, then® 


m 


a:! ifc~a 


* J. F, Steffensen, Inlerpolalion (1927), p. 101. 

* J. F. Stefienften, “F&otoriai MomentB and DiBcontinuouB Frequency Fianctione" 
Skandinaviak AktuarieiidBkrifl, Vol. VI (1023), pp, 73-89, 
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The moments about any constant of the distribution given by (4.3) may bo 
derived from tho factorial moments by the relation’ 

(4.7) E{x - ay = (1 + ffiA + (r2AV21 + • • ■ + (t = -a) 

where A ia the difference operator of the calculus of finite differences, and 
is replaced by (—a) after the indicated operations have been performed. 

Of special interest ia the case when p, =: pa = * • • =. pn = - , f or which (4.3) 

n 

becomes 


(4.8) 



whore fo(n, N) and A^O'^ are as' defined in section 2. The probabilities in (4.8) 
are the respective terms of the expansion of 

For this case the r-th factorial moment becomes 


(4.9) ffr = n(n — 1) (n — r + 1) (n - tY 

There ia presented an example of the distribution (4,8) for the case n = N = 10 , 


It is found that® 






'ao'" 


1 

A®0*" := 16436440 



= 

1022 

A^O^” = 29635200 

(4.10) ^ 


= 

55980 

A®0“ = 30240000 


a'o'^ 


818520 

A’O”' = 16329600 


^aV 

= 

6103000 

a'^O”* = 3628800 


/ 

TTOO 


.000362880 

ttbc “ .128596600 


ITlO 

= 

.016329600 

TTeo == .017188920 

(4.11) 

Wzo 

= 

.136080000 

fl-To = .000671760 


W30 

— 

.356622400 

TTso = .000004699 

1 


= 

.346144240 

ireo = .000000001 

(4.12) 

fcri = 

3.480784401 

m = 3.486784401 

\lT2 = 

9.663676416 

(t’ = 0.992795368 


^ This result is derived aa follows: (x — o)'' — (1 + A)*.(~o)'’; £(a: — a)' ^ {x — ay 

Ki) » (s (1 + (-«)^ = (1 + + x(» - 1)AV21 + ■ - ^/(x)^. (-a)'. For 

iv bivariate distribution it may bo shown similarly that, symbolically, ^f((x — — l>)') 

*= [oxp(ffi. Ai + <r.i Ai)) ■ where - ffmn and Ai operates only on a and 

operates only on b. A similar result may be derived for a multivariate distribution. 

• cf/ Whittaker A Robinson, op. oil. p. 7. 
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The observed distribution was obtained by distributing 200 sets of ten digits 
each, the digits being found in Tippet's Random Sampling Numbers.® The 
results obtained are given in Fig, 2, Three of the 200 observed sets wore 
illustrated in section 3. 

The agreement between observed results and theoretical values is gratifying. 


5. Distribution of the number of events which occur once each* Let , 
represent the probability that there are k events which occur once each. Thus, 
the various probabilities, obtained by rearranging the terms of the expansion of 
(pi + 3?a + ■ • • + V»Y} as follows: 


TToi =? 


TTu ~ 


£ — i pV + • ' ■ + = N, no .r = 1; 

ftJll ‘ * Xjt 


aJi + + ■ ■ ■ -b ®n-i - N — I, etc., no a; =: Ij 


AH 


(5.1) 


WJtL = Pl'Pl • • ' pfe £ 


m 


• Pn + b Pi«-fc+l • ■ ■ Pn 


r.»n 


m 


ajfc+i! ' • • .'Prtl 

— * i Pi Pn-kf 

XO'" X„-)>\ 

I 

Xi -b *2 -b • “ + - N - k, etc,, no X =* 1; 


No. of events 
not ocoairlng 

Observed 
frequenuy ' 

Theoretioal 

frequency 

1 

3!(X-1)/ 

Observed 

parameters 

0 

0 1 

1 0.G8 

0 

0 

1 ffi = 3 , 46 

1 

8 

3.26 

8 

0 

' 0^2 = 9,61 

2 

■22 

27*22 

44 

44 

X ^ 3,46 

3 

1 72 

71.12 

216 

432 

^ 1.0984 

4 

72 

69.02 

288 

864 

Theoretical 

5 1 

21 

25.72 

105 ' 

420 

Parameters 

6 

4 

3.44 

24 • 

120 

ffi = 3.49 

7 

1 

0.14 

7 

42 

fji s= 9.66 

8 

0 

0.00 

0 

0 

m — 3.49 

9 

0 

0.00 

0 

0 

= 0,99 


200 

200.00 

692 

1922 



Fig. 2 


* L, H, C. Tippet, Random Sampling Numbers, Tracts for Compulerc, No. XV (l&S?), 
London. 







DISTRIBUTIONS DERIVED PROM MULTINOMIAL DISTRIBUTION 


137 


In view of (2.21) and (2.27), it is found that (5.1) becomes 
fTToi = 1 - S P,’(l — p<)^“^ 4- T! P.'D,(l - 


= l~Nj^ p,’(l - p<)^ ^ 4- 2 - Vi - ViY * 


(5.2) 


1^11 - P<(1 - PiV ^ - (N - l)^^^PfP;(l -J>i- P,)^ ^ 4- • • ■ 


TTil = 2| < P(P,(1 - Pi “ p,‘) . y 


{i 9^ jf etc.) 


From (5.2) there is readily derived the fact that 

V, = iV(i\^ - 1) - . . (iV " r -h 1) 

(5.3) 

X) PaPi • ■ ’ Pr(l - Pa - Pi Pr)^"', (» 5^ eto.) 

a,6r ■ ►iT"! 

For the case in which p^ = pi = . ■ ■ = the distribution in (5.2) 

becomes 


iTii = Q) fi(.n, N) 

irii = nNfiin — 1,N — 1) 


CM) ^ ny ^ 


- r. iV - r) 


where /i(«, JV) and have been defined in section 2. For this cose (6.3) 
becomes 

(5.6) o-r = - rY~'/n^ 

Evaluation of (6.4) and (5,6) for » = iV = 10 yields, 


(6.6) 


TToi - .00811639 

TTu = .27052704 

Tsi = .01632960 

TTii = .04794633 

T6i ^ .16621984 

Tn = .00000000 

TTsi = .14082336 

X8i = .12700800 

TTioi ^ .00036288 

irai = .21089376 

TTTi = .02177280 



3.87420489 
= 13.58954496 


m = 3.87420489- 
:= 2.45428632 ' 


For the oaee n = iV = 10 there cannot be 0 events occurring once each, eince then the 
tenth event Tnuet also occur onoe. 
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' The observed distribution, given in Fig, 3, was obtained from the 200 sets 
previously considered. 

The agreement between the observed results and theoretical values is 
gratifying. 

6. Distribution of the number of events which occur r times each. Let 
TTfcr represent the probability that there are h events occurring r times each. 
Thus, the various probabilities, obtained by rearranging the terms of the ex- 
pansion of (pi + 332 + * ■ * + Pn^, are na follows: 


No. of eventfl 
occurring 
once oacn 
£ 

Observed 

frequency 

f 

Theoretical 

frequency 

xf 

a!(x-l)/ 

Observed 

parameters 

0 

1 

1.62 

0 

0 

ffi 3.906 

1 

10 

1 9'. 58 

10 

0 

9i = 14.000 

2 

30 

28.16 

60 

60 

X = 3.905 

3 

37 

42.18 

111 

222 

s® ^ 2.656 

4 

62 

64.10 

248 

744 

Theoretical 

6 

27 

31.24 1 

136 

640 

Parameters 

6 

22 

26.40 , 

132 

660 

(Ti = 3.874 

7 

3 

4.36 

21 

126 

(Ta 1= 13.690 

8 

8 

3.26 

64 

448 

3.874 

9 , 

0 

0.00 

0 

0 

.js »= 2.464 

10 

0 

0.08 

0 

0 



200 

199.98 

781 

2800 



Fig. 3 


( 6 . 1 ) 




JV! 


Tlr 


= E --|777^j ‘ xi + xa d - 1 - = JV', no x = r ; 




rl ^X2\ • ■ • Xnl 


p? ■ • ‘ 




JVI 


xi -!- X 2 + ■ ■ ' + - N — r, etc,, no x = r; 


TTh ~ 


pipi ‘ ■ pit 


JVl 


(r\)^ 


P^V-*-p** + 


I Pn-Hl ■ ■ * Pn V 


N\ 


pi' ’ • ■ p*-**, 


(r!)* 

4* ®2 + ■ ’ ■ + - N — hr, etc., no x = r; 
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In view of (2.21) and (2.27) it is found that (6.1) becomes 

jy^<r) n 
^-1 




(2r> n 

L pJpKi 


21 (rl)^^ 


(6.2) { 


Tif = 


rl 


L pi(l - ptY " - S vWiiX -pi- } 

l,*-l Ti ij’^l J 


j^(2r) f n 

= 2KHrn.S. ~ ■ ■ ■ 


a j, etc.) 


From (6.2) there is readily derived the fact that 
(6.3) <Tk = 2 p»pb • * ' Pfc(l -Pa — Pb—"-- (a 5^ &, etc.) 

V"1J a.b,'--.*-! 

For T = 0,1 (6.2) and (6,3) reduce to the values previously derived. 

For the case in which pi = pa = • • • = p„ = 1, the distribution in (6.2) 

n 

becomes 


(6.4) 



where /r(«, has been defined in section 2. For this case (6.3) becomes 
(6.6) n = - fc)''‘‘7n* 


7. Simultaneous distribution of the number of events not occurring, and of 
the number of events occurring once each. The ’jirobabilities for tho simul- 
taneous occurrence of ..the various combinations of the number of events not 
occurring, and of the number of events occurring once each, are given by rear- 
ranging, the terms of the expansion of (pi + pa -1- • • • + Pn)^/ and are given 
as in Fig. 4, 

In Fig. 4 none of the subscripts take on equal values simultaneously, and (7oi 
has been defined in section 2. Summation of the values in the fc-th column 
of Fig. 4, yields the probability that there are {k — 1) events not occurring. 
Comparison with (4,2) yields 

Fi)(n,N,pi,P3, ,p„)=Oi,{n,N) = + liP<) 

(7.1) 

1^(2) n 

+ -oT p^p,(7oi(fJ - 2, JV “ 2, Pi, Pf) H , (t 7^ j, etc,) 

iSl *./-i 
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Number of evenba nob ocourrinjr 

0 

1 

r 

0 

(7oi(?i, N) 


• • • 

1 

N'£viGoiin-l,N-’l,Pi) 

i-l 

N 2 

(n-2, jy- 1, p{, Pi) 

• I 1 

2 

2^^PiPiGoi 

(n-2,iV-2, piPi) 

(ft - 3, AT -.2, p{,p,,Pk) 

• * I 

e 

* * 4 

1 

1 

t 1 1 

N<*> V 

r 1 5 

a, b,. I, Bi/S, 1 

p<tPb * ' • p«G*oi(ft — r — a, 
y-S, Pa, “* ,P„ 
1 Pf) 


Fig. 4 


, Summation of the values in the fc^th row of Fig. 4, yields the probability 
that there are (If - 1) events occurring once each. Comparison with (6.2) 
and (2.27) yields 


(7.2) 


Fi{n, NiVifpi, ’ • ■ j Pn) = Oiiriy N) = f7oi(w, iV) + ^ -- 1, JV, pi) 

<“i 

+ £ Goji(» - 2, N, Pi, Pi) , a 7^ j, etc.) 

21 <j-i 


If we use X to represent the number of events not occurring, and y the number 
of events occurring once each, then it is found that 


Eix^'^y^*^) = ff„ = 2 Vapb ' ■ ■ Pj( 1 “ Pa - ' ■ • - p* 

(7.3) a,6,‘< 

PpY~\ {a 7^ h, etc.). 

If o®Ai represents the average number of events not occurring, when there 
are h events occurring once each, then from Figl 4 there is found that ' 


2 ^ 1, iV, Pi) + 2 2 (jQiCn — pc, p,)/2l 

<-i 

n 

“h ^ 3, iV, Pi) p^, pj])/3l + ■ ' • 

(7.4) nSoi- ^ 

G(n{n, jft^) “h 2 Goii'n — 1, JV, pf) 

i-'l 

n 

+ £ ^^^(w — 2, N, pi,pi)/2l + ■ ■ • 


(,i 7^ j, etc.) 
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In view of (7.2), (7.4) reduces to 
(7.6) . (,x,i = (i; ft(n, N, pd) / ft(», m 

A similar procedure, yields, in general 

n 

S paVb • ■ • PkGiin - k - l,N - k,pa,ph, - ■ t Pk,Pi) 

(7.6) o^fci ^ 

12 PiPh • • • PkGiin - fc, iV - fc, Pa , P6, ■ * • , Pfc) 

0 , 6, >••,*=1 

(a ^h, etc,) 

If iVka represents the average number of events occurring once each, when 
there are k events not occurring, then from Fig. 4, there is found that 

wli: p.GM(n - 1, Jf - 1, p,) + 2(W - J) 


d 9^ j, etc.) 


52 PiPjGaiin - 2, JV - 2, pi, P;)/2l + • • • ^ 
(7.7) m = ^ 

G,,{n, iV) + iV £ pMn - 1, iV - 1, p <) ' 


+ 52 PiPiGnin - 2, - 2, p^)/2! 


In view of (7.1), (7.7) reduces to 


(7.8) 


(t/Lp, 


G,{n - I, N-ly Pi) / G,{n, N) 


A similar procedure, yields, in general 

N 5 ^ PaG(iin-k-l,N -l,Pt:,Pbt ‘ " tPk,Pl) 

(7.9) ipto = ^ (a b, etc.) 

22 Gain - fc, iV, po, p4 > ■ ■ • , Pfc) 

■ ’,^1 

For the case in which pi = p2 = * ’ * = pn = “ 7 as may be found from Fig. 4, 

n 

the probability for the simultaneous occurrence of r events not occurring, and 
s events occurring once each, is given by 

For this case (7.1), (7.2), (7.3), (7.6), and (7.9) yield respectively 


Mn, N) = Un, N) + nN^n - 1, - 1) + (^3 jJV™/oi(n ™ 2, 

iV - 2) + 

(7.12) /i(n, N) = fnin, N) + nfoiin - 1, JV) + - 2, JV) + * • • 

(7.13) - r - s)^’'*/n'' 


(7.10) , 
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{7.14) - k)Mn - Wfiin - k, N - k) 

(7.15) ifiio “ iV(n - fc)/o(?i - k - I, N ~ l)//o(n - k, N) 

Let us consider again the case when pi — P 2 = " * = p«. = - and n = N = 10, 

n 

Evaluating (7.14) and (7.15) by means of (2.16) yields 

” 6.71 o^M “ 3.02 

o^ii “ 5.21 oXqi ~ 2,10 

<7.16) - 0^21 - 4.61 0*71 = 2.00 

Lxaj = 4.10 o$ei — 1.00 

(^0^41 = 3.28 flXsi 0,00 


ipoc ~ 10.00 ij/eo — 1.83 

ipio - 8,00 ifffio - 0,89 

(7.17) ■ 1020 = 6.16 1^70 = 0.27 

i&so = 4.50 lysa — 0.02 

,jSoo = 3.05 ' 1^20 = 0.00 


The 200 sets of observations already considered yielded the simultaneous 
distribution given in Fig. 5. 



Fiq, 5 
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The distribution in Fig. 5 yields an = 11.89, (7.13) yields an = 12.07959552. 

The agreement between the observed results iu Fig. 5 and the theoretical 
values in (7.16) and (7.17) is gratifying. 

8. Simultaneous distribution of the number of events which occur r times 
each, and of the number of events which occur s times each. The probabilities 
for the simultaneous occurrence of the various combinations of the number of 
events which occur r times each, and of the number of events which occur s 
times each, are obtained by rearranging the terms of the expansion of (pi + 
-}-'■■+ Pn)^. If TTjir.n is the probability for the simultaneous occuiTence of 
h events which occur r times each and I events which occur s times each, then 

^(kri-ls) n 

“ )■- 1 ? I ^ ^ Pa ' ' * PfcPtf * * ' 

(8.1) AGI n (r!) (S!) 

(n - Jfc - Z, iV ~ - fo, Pa, - • • , Pfc, Pa, • ■ ' , Px), (a 6, etc.) 

where C?„ is defined in section 2. 

From (8.1) and (6.2), there is derived, in a manner similar to the derivation 
of (7.1) and (7.2), the result that 


Frin, iV, pi , ■ • ' , Pft) = Grin, N) = GrXn, N)-\- Gr.(R - 1 , - s, Pi) 

5! i=i 


( 8 . 2 ) 


r(2s) Ti 


+ 21^2 PiGraifi - 2, iV - 2s, Pi, p,) d , (kV j, etc.) 


and a similar result by interchanging r and s in (8.2), 
For the distribution given by (8.1), it is found that 


iV r r A B 

^kl “ > |\jL. /pTxi Pa ' ' ' P^Pa ’ * * PX 

( 8 . 3 ) (^ 0 ^ a.br--.k,a,^.--.\~l 


(1 ~ Pa - • • • - Pa. — Pa - • • ■ - Px) 


y— Ar—ltf 


{a ^ 6, etc.) 


If r^u represents the average number of events which occur r times each 
when there are I events which occur s times each, then from (8.1) and (8.2), 
in a manner similar to the derivation of (7.6), it is found that 


r^la — 


{N - S PaPa ' ' • - 1 - Z, y - ?• - is, pa, P«, ■ ■ • , px) 

’ ‘.X™! 


(8.4) 


r\ Yi -l,N - Is, Pc 


,2>x) 

(a ^ etc.) 
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IJ repvesents the average number of events which occur s times each 
when there are h events which occur r times each, then by interchanging k and h 
and T and s in (8.4), there is found 


{N-krY*^ 2 ''PkV^Mn-’h^ 1,1^- hr -s, 

a," •,*,«=! 


tHKr 


( 8 , 6 ) 


2 p'(i-"plO,(.n-h,N -hr,'p,,--- ,pit) 


(a b, etc.) 


For the case when Pi = Pi - ■ ■ • = pn = ', it is found that (8.1), (8.2), 
(8.3), (8.4), and (8.5) respectively yield 


(8.6) 


- (i) 


Mil (rt)'> (si) 


-j/y,(ri - k - l^N - hr - Is) 


nN 


U) 


(8.7) 


/,(», W) = /r,(rt, N) + '^Un-i,N- s) 


, Md - W”’ , , „ „ ,s, 

+ nw.l'. /»(» - 2, W - 2s) + 


2! (si)* 

(8.8) sw = /+"W"^'‘'(n - it - l)''-‘'''7(r!)‘ (s!)' n" 

(8.9) 4i, =‘(n~ 1){N ~ i8)*'’/.(» - 1 - i, JV - r - ls)/rl/.(n -l.N-ls) 

(8.10) = (n - k)(N - fo')“/r(n -k-l,N-kr- 8)/sI/,(n — k,N — kr) 

For r = 0, s = 1, the results derived in this section of course reduce to those 
already derived in section 7. 


9. Conclusion. It is clear that the same method of procedure may be em^ 
ployed to study the simultaneous distribution of the number of events which 
Occur r, 5, • • • ft, times each. However we will not continue the discussion 
any further. 

We have thus seen that the multinomial distribution serves as the back- 
ground for the study of a number of distributions which have certain practical 
applications, 

The theory discussed herein has been illustrated by several examples which 
yielded gratifying agreement between observed and theoretical results. 


Washington, D. C. 
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A PROBLEM m LEAST SQUARES 
By Jan K. Wisniewski 

§1. We are dealing with two variables^ the observed values of which are 
denoted x and y respectively. The pairs of observations are divided into r 
groups, numbering ni, 712 , • * * Wr pairs. Suppose in each group we determine a 
regression equation of the following shape: 


yi = + 6,-a; + • * ■ mix' (1) 

where yi denotes the value of the "dependent" variable obtained from the 
regression equation, while y without any subscript denotes its observed value, 
The r regression equations of type (1) are not assumed independent; on the 
contrary, we postulate that 

r 

2 =3 Oi) boa + • ■ * wiflX* (2) 

1 


be fulfilled identically in a;; Oo, bo, • • • wi© being predetermined numbers. Tins 
leads to the following conditions: 


r r r 

2 a< = og X) 6j = 60 • S mf = Wo. 
1 1 1 


(3) 


The magnitude to be minimized under the theory of least squares is now 

K “1 

I 

1 


E [l/ - (a«- + + ■ • • + Er “ 2 

+ ^60 - (ma- ^ a' 

The normal equations derived from (4) are of the following shape: 


i2 


(4) 


ri” ' ' ' Wj X 


UjO'i d" E + 5,E/a: + ^Ei>(^(Er ») 

+ (Era;*) = TiiV - ErJ/ + + &o Era; + Wo Era:* 


( 5 ) 
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flj Hi ^ + (^2 <1.-^ (Lr ic) + hj 'Hi ^ (Hr x^) 

Hi + (2 (2r = Hi xy - Hr xy+ ao Hr a 


4“ ■ ■ • w/ 


+ &0 2r +•“ mo Hr ■'c'"''^ 


(5) 


Hi X* + (H (Er x^) + hi Hi + (E &.) (Er O 
+ ■•■ mi Hi K ' + ^E (Er x^*) = Hi x^y — Hr x^y 

4 “ flo ^ JT 4^ 1^0 ^ J T X * * ' via X 


El meaning a summation extended over the Mh group. As (1) is of the 
s-tb degree, we have (s 4- 1) 0’ “ 1) parameters to determine and as many 
equations, the problem thus being in theory solved,* As to the numerical 
solution, Doolittle's method or any other may be applied. We do not enter 
at present the question, how much labor would the actual solution require. 
Examples. Allen and Bowley in their book on “Family Expenditure” 
(London, 1935) assume the expenditure on some denned item / to be a linear 
function of the total expenditure e 

f = fee -\- G. (6) 

Evidently E /c E c » 0 (efr. pp. 10-11). Another example I give in a 
paper on seasonal variation, which appeared in “Economic Studies” III 
(ICrakdw). > Actual values y of a time series are assumed to be Linear functions 
of certain “normal” values x 

y - a 4 - ha; (7) 

a and b changing from month to month but constant from year to year. Then 

E a ^ 0, E h ^ 12. 

§2. Methods of solution in special cases. The generally recognized methods 
of solving normal equations become extremely laborious as the product (s 4- 1) 
(r " 1) grows large. As a matter of fact, the amount of computer's work is 
approximately proportional to the cube of the number of parameters to deter- 
mine. Therefore short cuts seem to be indispensable, A most elegant one is 
at our disposal in the special case^ when the values of x in the several groups 

* Tho remaining s + 1 paTametci's Or, br, - ■ • m are, of coursB, found from (3). 

’ This seems to be realized in Allen and Bowley's work. 
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are identical, or, at least, the sums m, Li x‘, ■ ■ ■ Li are identical 
in Instead of (1) we shall write 

j/i = Ai BiXi + . * • MX, ( 8 ) 

where Xi , X^ , • “ X, are orthogonal polynomials, i.c, such that, L XfX,- - 0 
if and only if i ^ j. In general, Xh = X* aj_i * aj , the coefficients 

being rational functions of «, L L * ’ ■ L 
The conditions (3) can now be replaced by a set of equivalent ones, viz. 

= Z5( = Ba--- Em, = (9) 

1 1 1 

How the actual values of j4b, J3o, * • ♦ Mo are found, will be shown in the next 
paragraph. The solution becomes now very easy, as the normal equations 
for the determination of each set of r — 1 parameters are independent, i.e. we 
can calculate the A's separately, then the B*s etcl, the order of solution being 
of no importance. Moreover the shape of the normal equations permits of 
considerable simplification of solution. Suppose we have to determine the 
values, of the coefficients K, corresponding to Xh. The normal equations are 
now — after certain simplifications — 

2Ki -h JCa + -f • • • ifr-i = •^-^2 “ ^rX/jJ/) + Ko 

JCi + 2K! + K,+ -" (El Xsy - E, x,y) + 


-^^3 "h -^3 -j- ■ » ■ 2i?r— 1 




(Er-.XtV - ErX«/) + K,. 


Adding these equations, dividing the sum by r and substracting the quotient 
from the j-th equation, we get 



1 / V 

rlv Exl 



( 11 ) 


The first member of the right hand side of (11) should be regarded as the 
principal term: this is actually tho value we would obtain for Kj, were this 
coefficient independent from the other K*s. The second member is a correction 
term, the necessary amount of correction being distributed equally among the 
several K^s. The simple solution given by (11) is only possible if the sum 
L Xl is the same for each grou^ From the definition of X^ we see that it 
is equivalent to saying that be identical in i. As 

k increases to s, wc pome to tho condition given at the beginning of this parsr 
graph. 
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§3. If this condition is not fulfilled, we can, indeed, replace the power series 
in a; by orthogonal polynomials the second subscript being appended 
in order to show that the values of the X polynomials are no more identical 
for the several groups; these polynomials are now orthogonalized separately 
within each group. But we are no more able to predetermine the values of 
jIo, jUo, ■ • ■ Jlfo, as they depend on each other; this will be made clear a little 
later. Therefore we have to resort to an approximation; the values of the 
parameters will not be found from Bimultaneous equations, but successively, 
step by step, beginiiing with those corresponding to the highest degree of the 
independent variable, 

The values of Oa , fto , • • • are given. It is evident that nh ~ The 
i-th normal equation is now: 

MtHiXU - + TL (12) 


We see at once that 


Mi=: 


M,j:,,xli ■vZ<X..^ 




Inserting this into /1 2/ we get 

Xfii/ 


r\ HiXrx'y 


Mi = 


■V _ 

1 1 


il/o 




i:* iiixu 

i 


(IS) 


(14) 


The second member of the right hand side of /14/ is again a correction term, 
the necessary amount of correction being distributed in inverse proportion to 
Now we determine the value of La, this coefficient corresponding 
to s — 1, the second highest degree of x, and calculate the several L’b from 
equations strictly analogous to (14) thus accomplishing the second step of our 
work, and so on, down to the j4's. Lq is found from the following equation: 

t. = j. - S [«:-!(<) • w. (15) 

i 

To is now appended a bracketed i, this to stress its variation irom group 
to group. We see from (15) that before the several are calculated we are 
not in a position to determine La . On the other hand, if is the same for 
all groups, the second member of the right hand aide of (15) simply reduces 
to and Lo can be determined in advance, i.e, before calculating the 

Af’s- This is the case treated first (in §2). In any case, if no definite corre- 
lation is to be expected between and Mi, the approximative method 

developed here should give very nearly correct results. The writer applied 
this method of solution to the simple problem of seasonal variation mentioned 
in §1 and found the results very satisfactory. 



A SIGNIFICANCE TEST FOR COMPONENT ANALYSIS 

By Paul G. Hoel 
1, Introduction 

During the last few years several papers and books have been written on 
various aspects of what has been teimed component or factor analysis. This 
analysis has arisen from the psychological problem of describing the results on a 
series of tests in terms of a few distinct abilities or components. In much of 
such work it is claimed that there does not exist more than a certain number 
of eomponentSj the material discarded in order to substantiate such a claim 
being considered as due to random errors of sampling or errors of measurement. 
However, mere inspection of results or the calculation of standard errors of 
residual correlations is hardly sufficient to justify ^uch conclusions, and there- 
fore a significance test of some kind is necessary. Hotelling^ considered such 
a test but based it upon an uncertain analogy with the analysis of variance 
and upon the legitimacy of using standard errors. The purpose of tliis paper 
is to derive a test which is more general in scope and in which all assumptions 
are explicitly stated. 

If each test score is thought of as being made up of two parts, a true score 
and an error element, the assumption that there exists fewer components than 
the number of tests implies that the scatter diagram of the true scores will lie 
in a apace of correspondingly smaller dimensionality. Consequently, an ideal 
test for the number of components would be one which would test the rank 
of the true moment matrix. In the case of normally distributed variables, 
this line of approach leads one to the sampling distribution of the generalized 
variance. Unfortunately, this distribution appears in unintegrated form; how- 
ever, by considering its moments it is possible to find a good approximation 
to this exact distribution for samples which are not too small. 

The paper proceeds by first finding two approximation distributions for the 
generalized variance, one for samples which are not too small and one for large 
samples. It then considers the type of population from which it will be assumed 
the sample was drawn, and finally applies the test to two numerical examples 
from recent literature along such lines. 

2, Approximation Distributions 

Suppose that N individuals have been drawn at random from an n variate 
normal population whose distribution is expressed by 

(1) P{xi, X2f •'* t ^n) = Ke 

^ Harold Hotelling, Analyeis of a Complex of Statiatioal Variablca into Prinoipal Com- 
' ponentfl. The Journal of Educational Psychology, September and October, 1933, pp. 21-25. 
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where .t,- = Xi — Wi, 4(,' = 


A* 


i; 


ffj L 


, A is the determinant | | and An is the 


cofactor of pn in A, and K " ( Ayy If the observed values of the 

variables of the o-th individual are denoted by = 1, 2, • ■ - , n), then the 

generalized sample variance is defined as 2 = 1 ayy 1 > where a,y = -^ S (X^ia ^ Xy) 

iV a-al 

(Xj'a — Xj). Wilks^ has shown that in sampling from the population (1), 
the fcth moment of the sampling distribution, of 2 is given by 

JN + 2fc - 

■'d — 2- 


J N + 2k- + 2 ^ - 2 ' 


M, = A 


-fc 


-•) 






■c^) 


where A = | ^,y j . An inspection of the integrated form of the distribution 

of z in the case of ft = 1 ai\d n = 2 suggests that there likely exists a function 
of similar form for higher values of n whose bth moment can be made to difier 
from M}t only in higher powers of terms which contain N~^ as a factor. An 
.investigation along such lines leads to the function 


( 2 ) 


whore C — 


y-n j 

2 2 
a n 


r^n 


JV^ft' 


g{z) = 

- ft - 2 , , (ft“l)(ft-2) 

,m = s ,a = A9and5==l-^ 1. 


2N 


It will be sliown that the Jfcth moment Mk of g{z) differs from Mk only in terms 
of magnitude less than the second and liiglier power’s of h^n/N or kn/N, 
Multiplying g{z) by / and integrating over the entire range of z will yield 
Mk , which turns out to be 


r(- 


X — ft + 




Upon reducing the upper gamma function and performing successive steps of 
simple algebra 


/ -k-nk / , — TT + 2/c 


Mk - ern 


?i 






N -n^ 


2b 


ft — 2/ft\/ 2b — ft — 4/ft 

w — + — w 




fi h_ ^ 2fc?i,/ft \ 

\ X J’ 


* S. S. lYilka, Certain Generalizations in the Analysis of Variaiien, Biometrika, Vol, 
XXIV. 1923, p. 477. 
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The terms in parentheses may be treated as the factored form of a polynomial 

of the nfcth degree in unity. Thus the quantities — ~ etc., may be 

treated as the zeros with signs changed of the corresponding polynomial in 
X (say). As a result, the successive terms after the first in the non-factored 
form of this polynomial in unity are the sums of the products of these quantities 
taken one at a time, two at a time, etc. Upon performing this multiplication 
and letting 0 = MjS assumes the form 

t 

where the neglected terms are in magnitude less than the second and higher 
powers of or kn/N, If Afjt is handled in exactly tlie samo manner, it 

will be found that 

M, = + f ~ ^ . 




.jiri — 2/5+3) 

V 1 ^ 


( 


N-\-2h- 


2 --')••• 


+ 


(‘-j) 

1 


2k — n-2\ /. n\ 

V+-i^)-V-Ti) 


where the neglected torms are of tlie same order of magnitude as those neglected 
in the approximation to Ml . Before a comparison of Mu and ML is possible, 
the factor q~^ of Mjt must be expanded and multiplied into the quantity in 
brackets. This operation yields the result 


Ml = 1 - 


iik{n — 2fc + 3) 

2F 


4* 






Thus Mk and Mi agree to within neglected terms. As a matter of fact, if 
the values of the neglected terms are" considered more carefully, it will be found 
that the actual difference between Mk and ikf* is considerably less than the 
given upper bound for the magnitude of neglected terms would indicate. Tor 
example, when n = 5 the first term in the difference is 6fc(/i; — .Q)N while 
625fc“iV^“ or is the upper bound for this term when only general results 

are used. The general formula for the first term in this difference has been 
obtained, but since the remaining terms have not been investigated and since 
the type of problems to which the distribution g{z) is to be applied docs not 
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justily this refinemDUt, it Tvill not be consideTed here. Conaequeiitly, if one 
considers this distribution function as sufficiently determined by its low order 
momenta and if one applies g{z) only to problems in which N is fairly large 
compared with then the function g{z) will give a good approximation to the 
exact sampling distribution of z. Obviously, g{z) is identical with the exact 
distribution for the known cases of w = 1 and « = 2. It is not possible under 
the above expansions to vary the constants in the form of g{z) in such a manner 
as to obtain an approximation whose A:th moment will agree with Mh to within 
still higher powers of comparable terms. 

In order to test whether or not a sample value z ^ Z can be reasonably 
assumed to have been obtained in random sampling from a population of type 
(1) with fixed A, it is necessary to calculate the probability P of obtaining in 
repeated samples a value of z greater than Z. Thus it is necessary to evalua te 


? = 1 - jf g{z) dz. 


jy ^ 

Upon making the substitution x — n\^az, ,&nd letting p = n — — 1 and 
u = = nN /^ 1 - l2ti(W - 


ft)] ^ this 


integral can be reduced to the standard form of the incomplete gamma function. 
Hence P assumes the form 


(3) P = 1 - I{u, p) 

where 

1 f uVp+1 

In many applications of this distribution it will be found that the values of 
u and p lie beyond the tabled^ values of these constants. Consequently, it 
will often be sufficient to use the normal distribution to which the gamma 
distribution tends as N becomes large. This normal distribution will be 
considered next, 

Rather than obtain a normal approximation to g(z) or the gamma function 
to which g{z) reduces after the above transformation, it is more illuminating 
to find the basic descriptive parameters of the exact distribution of 2 and from 
them obtain a normal approximation. Such a procedure will show how rapidly 
the distribution of z approaches normality with mcreasing N. By using the 
recurrence formula connecting iWjt+i and Mk , which can be found directly from 
the ratio of these two moments, and expressing the necessary moments in 


* K, Pearson, Tables of the Incomplete Gamma Function, Biometric Laborotory (1922), 
Univ, of London. 
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terms of M \ , it can be shown that these basic descriptive parameters are expres- 
sible in expanded form as follows: 


w = 0 j^l — 

a ,2 [2 


+ 1) n(n -f- l)(n - l)(3n + 2) 




+ 


24 


2n n(2n“ -7^+1) 




+ 




^1 = 


2(3n - 1) 


2r 


nN 






, (rt -I- 1)(57I - 3) 


2(3» - 1)N 


4(3n - 1)(4« - 1) 

T^T " I 


3niV 




These values suggest that 


+ 


h 


(4) 



H 


will likely be distributed approximately normally with zero mean and unit 
variance. As a matter of fact, by using the second limit theorem of probability/ 
it can be shown that the distribution of w approaches normality as N increases 
indej&nitely. Hence, for samples in which N is large compared with n^, it 
will be sufficient to compare the value of w arising from a sample z — Z with 
its variance of unity if a test of significance is desired. A better general ap- 
proximation could have been obtained by centering the curve at 

rather than at (^; however, since there is positive skewness anc 
lies between these two values, there might arise some exaggeration in a signifi- 
cance test in doing so because the accuracy of such a test depends upon the 
accuracy of the approximatioii in the right hand tail of the curve. 

Inspection of (3) and (4) shows that the only population parameter upon 
which these approximation distributions depend is There are no assump- 
tions necessary about the population means, or variances, or covariances, 
except in so far as they may be related when the value of is postulated. This 
means that either (3) or (4) enables one to test whether or not it is reasonable 
to assume that the sample variance z — Z arose in random sampling from some 
normal population with <l> equal to the postulated value, 


. n{n + 1)1 

2N J 

I the true mean 


3. Population Assumptions 


Consider the set of variables wi , U 2 , ■ • ■ , Wn distributed according to the 
normal law 

n 

“S Iff/ 

(5) JP(uij , tin) = Ki6 

* See, for example, Frechet and Sliohat, A Proof of the Generalized Second Limit 
Theorem in'tlio Theory of Probability, Trangactione of the American Mathematical So- 
ciety, Vol. 33, (1031), p, 633. 
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and the set of variables Uj , ^ 2 , • ‘ distributed according to the normal law 

n 

(6) P(wi , , • ■ ' , Un) — e ^ 

where the y's are uncorrelated with the m's and with each other. The joint 
distribution of the m's and ti’s is expressed by 

-2 *'t | “i “/-S «i 

( 7 ) ■■■ ,v,) = K,e ' ‘ . 

f 

Upon writing down the determinant of the coefficients of those 27i variables, 
it will become evident that any one of its principal minors of any order can be 
expressed as the product of a principal minor of [ bi,' ( with a principal minor of 
I c< I . Since the distributions (5) and (6) are normal, the determinants | 6,-,- 1 
and I Ci I are positive definite; consequently the determinant of the coefficients 
in (7) must also be positive definite. 

Now consider the orthogonal transformation 


Vi = 


Ui H- Vi 

V2 ' 


i=l,2. 


n 


Ui - Vi . , . . „ 

Vi “ — 7^ I 1 + 1, ■ • * , 2rt. 

V2 

Since the determinant of the coefficients in (7) is invariant under an orthogonal 
transformation, the resulting distribution of the if 5 may be expressed by 

2n 

-'L^aoivi 

w PiVhVi, ,y2n) = KiB ^ 

where | | is positive definite. 

In order to obtain the distribution of the variables ?/i , ?/ 2 , • • • , y.. , it is 
necessary to integrate (8) with respect to the variables ijn+i , ■ ■ • , ?/ 2 r» over 
their range of values. If this integration is performed after the quadratic form 
in the exponent of (8) has been expressed as a sura of squares^ with coefficients 
which are the ratios of principal minors of | da \ , it will be clear that the inte- 
gration leaves a quadratic form in the exponent which is also positive definite. 
Hence after the transformation Xi = -s/^yxii = 1, 2, > - ■ , n) the distribution, 
function of the variables Xi - lii + u,(i = 1, 2, * ■ • , n) must be normal and 
may be expressed by (1). Thus it has been shown that if the true parts m 
of the variables a;,- are normally distributed without error and if the error parts 
arc normally distributed but are uncorrelated with the Ui and with each 
other, then the variables Xi possess a normal distribution, The advantage of 


* Seo, for examplCj Risacr andTraynard, Lea Prineipes dfi la Statiatiquo Mathematique, 
1933, p. 226. ' 
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tins formulation will become evident when the parameter is expressed in 
terms of the parameters of (5) and (6). 

Since the v’s are uncorrelated with the w'a and with each other, the variance 
ffi of Xi is the sum of the variances of w,- and Vi , while the correlation pn be- 
tween Xi and Xj may be expressed in terms of the correlation p'a between Uf 
and U] and the variances Ui, v* of n,-, Uj, Vi, t»,- respectively. These 

relationships are 


(9) = m 2 + , and p,-,- = 


/ 

Pii 


'V^(i + 4 /m2)(i + 


(“i 5^ j). 


For simplicity of notation let X,* = . Now it is well known that ^ can 

be expressed in the form 


4 2 2 

0 = ffi tra ■ ‘ * (r„ p*/ 


If the values from (9) are inserted in | pn | and if the resulting denominators 
of elements are factored out, 0 will assume the form 

Q a n 

ffiffi • • • 


(j> = 


where 


B = 


(1 -|- Xl) • • * (1 *1- Xn) 


1 + ^1 Pl2 ■ ■ ■ Pin 
/ 

Pia 


/ 

Pin 


1 + X„ 


Following the methods of confluence analysis,’ B can be expressed as follows.* 


n n 

5 = 2E “h “h XaX^B)ap( * "b X 1 X 2 ' ■ ’ Xfi 

a=»l a<^ 

where R = | pv/ 1, is the principal minor of E obtained by deleting row 
and column «, etc. R is the true correlation determinant whose rank it is the 
object of this paper to test. If R is assumed to be of rank n — t, then all 
principal minors containing more than n ~ i rows vanish and B reduces to 

n 

B y 1 Xflj ■ ’ * XiK j ' • a (( *b ' ' ' “b X 1 X 2 ’ ■ ' Xn . 

The tests (3) and (4) were designed to test hypothetical values of ^ by means 
of the sample Z. Evidently the value of can be postulated by assigning 
hypothetical values to the X’s, the <r's, and the principal minors of J2, 
Assigning values to the X's does not curtail the degrees of freedom in these 


» S. S, WilkB, loo. oit., p. 477. 

^ Ragnar Friech, Statistical Confluence Analysia by Means of Complete Regresaion 
Systems, Oslo, 1934. 
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testa because they "weTe derived on the basis of (1) ■which depends only on the 
m's, a-% and p's. The X's do restrict the range of the p^s, but not their degrees 
of freedom. 

An inspection of the expression for ^ shows that 0 can be made to assume 
any desired value irregardVess of the rank of R by merely assigning the o's 
properly. It is therefore necessary to make some assumption regarding the 
<r's if the test is to serve the purpose for "which it is intended. Here it will be 
sufficient to assume that the product of the population variances may be re- 
placed by the product of the sample variances. This assumption will ordinarily 
be approximately, fulfilled for the size samples for which it is legitimate to 
employ (3) or (4) ; consequently this assumption does not restrict the range of 
application of the test. 

To postulate values of the principal minors of ii beyond postulating the rank 
of 72 would introduce hypotheses and restrictions which are irrelevant to the 
fundamental purpose of the test. This difficulty will be avoided by replacing 
all non-vanishing minors of R by their upper bounds of unity. Since this 
will overestimate the value of B, and hence of 0, the usual significance level of 
.06 may be considered as decisive. Let the value of B when unity is inserted 
for all non-vanishing principal minors be denoted by D, Then 

n 

(10) D = 2 * * ' ^a| d” ' ' * ‘ ' Xrt . 

Sincx- 

n ' n ' It 

XI (1 ■4' X<) “ 1 4" ^Xa 4" Xj Xnj Xoj 4“ * ’ ‘ 4* XlXl • * • Xn 

1 «I<ai 

it will often be convenient to ■write D in the form 

(11) I) = II (1 4^ X<) “ /l 4" 2Xa 4" ■ ' V 4“ XaiXflj • • * Xa,«, 

As a consequence of all the above assumptions, 



( 12 ) 


? — f I _ (l 4~ Xl) • • ' (1 4~ Xn) I Tjf 
^ ~ <p ~ B 

>;> (1 4“ Xl) * * * (1 4“ Xn) 1 Tij I 
D 


where | rn \ is the sample correlation determinant. 

All the essential material for testing the rank of the true correlation matrix 
is contained in. (3), (4), (11), and (12). In summary, the hypothesis to be tested 
and the procedure to follow in performing the test are as follows. 

The population of n variables from which the sample is supposed drawn is 
assumed to be such that (a) the true parts of the variables are normally dis- 
tributed, (b) the error parts are normally distributed but are uncorrelated 
with the true parts and with each other, (o) the product of the variances may 
be replaced by the product of the sample variances, (d) the values of the X^a 


I 
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are postulated as judged by the accuracy in measurement of the variables, and 
(e) the rank of the true correlation matrix is w — 

Given, the value | m \ of the sample correlation determinant, a lower bound 
for the value of Z/(f> is calculated from (11) and (12), This lower bound is 
inaerted in either (3) or (4), depending on the size of the sample. If (3) is 
used and if P ^ .05, or if (4) is used and w ^ 2, one may conclude, as judged 
by the sample variance, that it is very unlikely that the sample was drawn in 
random sampling from the population specified above. If one has reason to 
believe that the variables are sensibly normal as indicated above and that the 
postulated values of the X’s are quite accurate, then the test shows quite defi- 
nitely that the postulated rank of the true correlation matrix js unsubstantiated 
by the sample, and therefore a higher rank should be tested until a non-signifi- 
cant value is obtained. Because a lower bound rather than the value of 
is used, the test can be used on minimum ranks only, and hence a value of 
Z < 4t will not yield a test of significance. However, the test does handle the 
problem for which it was designed and which is of fundamental interest, and 
that is to see whether or not one is justified in assuming that a sample repre- 
sents only a certain minimum number of components. 


4. Applications 

(a) Hotelling^ has used an example taken from other sources to illustrate 
his test on. components. In order to compare results, this same example will 
be treated here under the assumptions outlined above. In this example the 
reliability coefficients are given. ITrom the definition of a reliability coefficient 

Tif it follows at once that r< = - — The population values of the Ws will 

1 “p A|‘ 

be set equal to the, values obtained from these sample reliability coefficients. 
The data for this problem are 


?-f/| = .236, N = 140, 71 = 4, Xi ~ .087, Xs =* .119, h 


.101, X4 = .773. 


Assume that the true correlation matrix in the population is of rank two, that 
is, that two components are sufficient to describe the results on these tests. 
Since N is large compared with ??.*, it will be sufficient to use (4). The values 
of (11), (12), and (4) axe found to be 




n (1 -f- X() I Tij I _ 1 
D 


90 


^ [1.90 - 1] - 3.76 

o 


‘ Loc, cit., p. 16 , 
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Since the standard deviation of w is unity, this value demonstrates clearly 
that the hypothesis of only two components is untenable as judged by the 
sample correlation determinant. If one assumes throe components, the test 
will be found to yield a non-significant value. Hence it may be concluded that 
under the hypotheses on which the test is based, the sample does not justify, 
the assumption of less than three components. Hotelling’s test indicated the 
necessity for two components but was uncertain about the third, the decision 
resting upon a variate value of 1,31 as against a standard deviation of unity. 

(b) Thurstone, in his Vectors of Mind,’' considers an example taken from a 
series of fifteen psychological tests. After applying his centroid method to the 
data, he inspects his results and concludes that four components are sufficient 
to account for everything except random errors. It is impossible to test his 
conclusions explicitly as above because the size of the sample is not given and 
the reliability coefficients are not known. Nevertheless, if it is legitimate to 
assume that the sample is sufficiently large to justify the use of this test, in- 
teresting conclusions can be obtained on the assumption that only four com- 
ponents are needed. 

Suppose that Xi = which implies that the variance of error is half as large 
as the true sampling variance for each variable. Here (10) is more convenient 
than (11) for computing the value of D. The values of (10) and (12) are 
found to be 

D = -h iMiY' + + (^y^ - .125 
? > hsiL 

4> ^ .0003* 

Evidently, the value of | r,-,- [ must lie in the neighborhood of .0003 if the test 
is not to yield a significant result which contradicts the hypothesi.s. However, 
the correlations in j Th [ are given to only three decimal places, and therefore 
a legitimate value in the neighborhood of .0003 can not be realized. It is to be 
noted that the postulated values of the X's are equivalent to postulating that 
all reliability coefficients are equal to f, a value which should be considered as 
unusually low, It would seem reasonable to avoid using material in which the 
variance of error is larger than one-half the variance of random sampling, unless 
the variance of random sampling is exceedingly small . 



CONTRIBUTIONS TO THE THEORY OF COMPARATIVE STATISTICAL 
ANALYSIS. I. FUNDAMENTAL THEOREMS OF 
COMPARATIVE ANALYSIS' 

By William G. Madotv 

This is the first of several papers in which there will be presented a general 
approach to the statistical examination of hypotheses which arc false if any of 
several things are true. Phenomena requiring such a gtatlstical theory are 
investigated quite frequently. As examples may be cited the studies of lag 
correlation in time series, jicriodogram analysis in geophysics, factor analysis 
in psychology, and analysis into components in agriculture.'* 

The theorems of this paper have one purpose: to permit the reduction of the 
distributions by which the hypotheses are to be tested to essentially the joint 
distribution of the statistics which contain the infoimalion offered by the data 
concerning the truth or falsity of the things which will negate the hyi)otheses. 
In order to do this it has been necessary to goneralijic the theorem of Poincare 
on the probability that at least one of several events occur, ^ As illustrations 
there are stated, after Theorems III, VI, and IX, goneralization.s of a distribu- 
tion derived by Jordan, (5) page 109/ 

In a second paper, wc shall give a complete derivation of the joint distribu- 
tions necessary for the applications of the analysis of variance. A reconsidera- 
tion of the Schuster periodogram will be included. In other papers these 
results will be extended to problems arising in the tlrcory of regres-sion, and to 
problems of. the distributions of medians, etc. 

The fundamental theorems of comparative analysis arc now obtained in such 
a form that they are applicable to problems in the theory of probability no 
matter what the distributions may be. Some special eases of these theorems^ 


1 Presented to the American Mathematical Society, March 27, 1037. ReBcarch under a 
grnnt-in-aid from the Carnegie Corporation of New York. 

^ Naturally these techniques are also useful in other branches of science then those in 
which they were first applied. It should be noted that by analysis into components we 
here refer to the work of Fisher, (2), chapter 0. 

® See, Pomcar6, (7), page 60, This theorem is attributed to Poincarfi by Jordan, (5), 
andFrfichet, (3). 

* This distribution states the probability that in r trials of nn experiment which has 
exactly n possible results, these results being mutually exclusive, each of the possible 
results occurs at least once. Jordan^s derivation iins been simplified by Frfichet, (3), 
page 12. 

* The theorems are, of course, part of the theory of measure and integration. 
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have been used in connection with the derivation of distributions of positional 
statistics such ns the in order of iV elements,® and others. 

Let S2 be a collection of elements a;, and let A be a set of subsets of il. Then, 
the axioms which the elements of A are to satisfy are^ 

I. A is a field;® 
n. S2< A; 

III. To every id € A there is ordered a non-negative real number P(4); 

IV. P(fi) = 1; 

V. If A 6 A and B e A, and AB = 0, then P{A -\- B) = P(A) + P{B). 
We shall regard as the set of possible results of an experiment e. By events 
we shall mean elements of A. The complement A of A with respect to will 
be an element of A if A is an element of A. A consists of all elements of 0 
which are not elements of A and hence is the event which occurs if and only 
if A does not occur.® 

Let the subsets of 12 

( 1 ) El f Ei , ' • ' , Ek 

be elements of A. Then, if ai , oa > • * * , a* is a permutation of 1, 2, • ■ • , fc, 
the set 

(2) Ea^Eft^ ■ • • EafEt,j+i ‘ ■ ■ Eaf, 

is an element of A and is the event which occurs whenever all the events 
Eai , Bai , ' * ‘ , E„f occur, while none of the events , Eaf+t , - ■ ■ , 
occur. 

The events (1) are said to be independent if and only if 



for all selections of the sets (1) and their complements/® 

Theorem L The 'prohabiUiy that the first j of ike k events (1) occur, while ike 
remaining h — j events do nol occur, is 


* See, for example, Gumbel, (4). It is noted that Theorems 1, II, and III are stated by 
Arne Fisher, (1), page 42, who aasuracB, however, that the events are independedt. 

T These axioms are stated by Kolmogoroff, (6), page 2. 

* A set of seta is a field if the fact that A and B arc elements of the set implies that 
A + -0, AB, and A — AB are also elements of the set. 

* The event A will be said to have occurred if the result o^ the performance of the experi- 
ment E is an element of A. 

See Kolmogoroff, (0), page D for a discussion of various equivalent definitions of 
independence. 
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(4) * ■ * -Ei:) = 2! (“l)*^ S P{^\ ■ * ' ‘ • ■ 

P'”0 ffi,' 

fti<“a<' *-<^7 

Proof, Let fc = j + L Then it follows from Axiom V that 


(5) ■ ■ ‘ ^f) — P(ifi^2 ■ ■ * JEf]Sj^.i) P(^EiEz • • • 

Hence the theorem is true for fc = j + 1 and any j > 0. Let tlie theorem be 
true for lb = j, j -f Ij ■ . ■ , fc — 1. ^ From Axiom V it follows that 

(6) P(A?i . . ■ E}JEi.y, . . . Jfc) 

= P(Ei • ■ • EjEjJ^.l • • ’ Ek-i) — P{Ei ■ • • EjEj^i • • • Ek-.iEk)> 

Substituting from (4) the theorem is proved. 

Let «>«•!+ ■ • • + Tij , nj > 0 (i = 1, • • ■ , 0 1 


n\ 

nj ■ nd (n — n-i — ' - ■ — «.<) I 

Corollary, If, for each value of r, (v = 
terms 


= (?i;ni,ns, * • • , n,). 

1, 2, • - - , fc - j), the (k 



P(Ei • • • EfEai • • * Ea,) 

which can be obtained by selecting ai , aj , ■ ♦ • , a, without repetition from 
J + L j + 2, • ■ • , fc, are all equal, then 


(7) P(Ei ' • • E,Ef+i ■ • • JSi.) = E (-!)'(*: “ jj 

Let ^ 

(8) Sfy) = E P(S.tK, ■ ■ ■ EJ 


where the summation extends over the (fc; i*) terms 


(9) P{EaiE«2 ■ ■ * -^Op) 

which can bo obtained by selecting v of the k events (1) without repetition. 
If all the terms (0) which can be obtained by selecting v of the k events (1) 
without repetition are equal, then 

(10) S{v) = (lb; ;^)P(^i . ■ ■ EX 


“ By definition 

E P(£.-..£,B,+l 

v =0 Ofi,'>',ar“#+I 

«!<■ ■ •<ap 

k-i k 

= P{.Ei ‘ ' * Ej) + ("ly PiEi f • ' EjEcti ‘ ‘ fi'rtp)* 

y— 1 ai,' > 
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Theorem II, The 'probability ihai exactly j of the k events (1) occur is 

(11) P« = £(-i)’(i + -';W + >')- 

(--O 

Proof. If il(,) ,is the subset of Q defined by the requirement that exactly j 
of the events (1) occur, then A^) is the sum of (fc; j) disjunct sets: 

ifc 

(12) A{i) = £ Pai •" • • ■ -^0*3 

ai, 

where dj+i , • • ■ , «<; have those of the values Ij • ■ - , ft which remain after the 
selection of ai > • • • , a,- . By Axiom V we may replace A by P in (12), Upon 
substituting from (4) we note that the resulting terms of (12) which depend on 
tlie same number v, r = j, • ■ ■ , ft, of events have the same sign, that all ;S(v), 
J/ = ^ k, occur, that no term depending on fewer than j events occurs, 

and that any particular P(jSaiPai * ■ ' -^ 1 *;+,) will occur in those of the terms 
of (12) the j ocDurrijig events of which are a subset of 2^a, , Eaj , • ’ • , 
and will occur in no other term of (12). Hence the coefficient of S{j + t) in 
(11) is ("1)* U + 1} 0* This completes the proof of the theorem. 

ConoLLARY. If (10) is true for r = j, > - - ,k, then 

(13) P(fl = 2 (- l)’(t;i, ■■) P(E.E, ■ ■ ■ EiJ. 

F-O 

Theorem III. The prohabiliiy that at least j of the ft events (1) occur is 

(14) P“ = 2 (-l)'(i + >• - 1; >■) S(3 + >■)• 

Proof. If A^^^ is the subset of Q defined by the requirement that at least j 
of the events (1) occur, then A^^^ is the sum of ft — j -H 1 disjunct sets: 

(16) = A(/) + A(,-+i) + " ' + A{ic) . 

By Axiom V we may replace A by P in (15), Substituting from (11) . 

(16) P“ = 2c.S(j + .), 

i -=0 

where 

c.- = (; + f'i i + v) - (i + v; 1) H — + (- 1)^^ + >'1 1'), (j' = 0, ‘ , ft - j). 

It is easy to prove that 

(17) (-i)’(3 + f - 1; .•) = 2 (-i)'-'(i + + (-). 

p-0 

Corollary. If (10) is true for = j, ■ - • , ft, then 

(18) P'O = 2 (-l)'(j + y - 1; v)(fc; j + y)P(.E.E. ■ ■ ■ Ei+.). 

v^O 
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To provide examples illustrating these theorems let us consider r experiments 

(19) 

Let have fc mutually exclusive outcomes 

(20) o!”, oS”, . . . , 

Then, it is easy to define the spaces fl ”, A*” the probability function 
the combinatory product 

Q = X X • • • X 

the set A and the probability function P(E) bo that Axioms I, • • ‘ , V are satis- 
fied and hence Theorems I, II, and III are valid. 

We shall assume that the experiments (19) are independent. 

Let 

0/ (j ss , k) 

be the event which occurs when neither Of* nor Of* nor • • • nor Of* occur. 
Then 0/ occurs if upon performance of the experiments (19) at least one of 
Oj‘\ Of*, ... , of* occur. 

It is an immediate result of the definition of independence that 

(21) P(0., (5.. • • • 5„) = n 1 1 - P(0‘‘’)| . 




From Theorem I, the probability that Oi , O 2 , • • • , Oy each occur while not 
one of Oy+i , Oy+ 2 , • ♦ • , Ofc occurs is 


P(0i 


( 22 ) 


1 • ■ • OyOy+i • ■ • Ojfc) “ ^ ( — 1)’ ^ 

r-O 




n [1 - p(0j‘A) p(oi'') - P(oi\') P(oLV)). 


From Theorem II, the probability that exactly j of Oi , Os , - ■ • , Or occur is 
(23) P(n = t(- mk - j + v; ,)S(k - i + v), 


*—0 


where 

S(ji-y + ,) = 


E n {1-^(0'.?) f’(0S,t,J)- . 

Since the probability that at least j of 0i , Os , • ■ • , 0^ occur is equal to 1 
minus the probability that at least fc — j -|- 1 of Oi , Oj , ■ . ■ , (5* occur, it 
follows at once from Theorem III that 


P{at least j of Oi, • * < , 0* occur) — 


(24) 


1 - ^ (-!)'(*! - j + P-,>)S{k -j + p-\- .1). 


There arej of couTse, other ways of computing these probabilities, 
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The case treated by Frdch^t and Jordan is that which occurs when we assume 
P(OS''^) = P(Oj'^), (i = 1, • • ’ , fc), {», A = 1, • • • , r) and in (24) let j « L 
It is not difficult to obtain further generaliaations of Jordan’s distribution, by 
defining events which occur if and only if fewer than f of r events occur and 
then proceeding as above. 

Certain useful generalizations of Theorems I, II, and III will now be derived. 
Let the subsete of SI 

(25) • ' • ) ; p) 

be elements of A, and let N = 4- -f * * ■ + 

Let (s = 1, •■•,?); and let 

(28) Q'‘’ = niI4‘’ (( = !,••■,?), 

a-1 f*-l 




(i — 1, • • ■ ,p). 


Furthermore, let for each value of «, (® — A, • ■ • , p), the (k^'^ — 
possible distinct selections of of the k^*^ — sets 

( 28 ) , Pji**) 


be arranged in some order, and, if the intersection of the sets of the 
selection be denoted by 


(29) 

let 

(30) 




(s — A, • ■ • , p), 

(t. = 1, 2, . . . , - /'>; .">)), 


i-fr 


There are f[ (fc^*^ — p^'^) sets (30), for each value of A, (A = 1, • ■ • , p), 

»~A 

and any set of fixed values of • , y^‘*\ 

Let for each value of s, (s A, ■ . • , p) the (k^*^; possible distinct selec- 
tions of of the sets 


(31) = 1, ‘ , k^'), 

be arranged in some order, and if the intersection of the seta of the selection 
be denoted by 


(32) 4‘*(/‘>) 

let 
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There are sets (33), for each value of {h = 1, ■ ■ ■ j p), and any 

a=A 

set of fixed values of • • ■ , 

It is clear that the various sets that have been defined are elements of A. 
The fact that, the sets are the events which occur if and only if certain sets of 
events occur is also too obvious to require further comment. 

Theorem IV> The prohahility that of the N events (25) the first of super- 
script s occur and the remaining of superscript s do not occur, s ~ 1, > • • , p, is 


(34) 


P(q(p)q(p)') ^ ^ 

vClJmO 


••• E (’-I) 

}»IP)h30 

(fctO— 




(l-l 


/tP>;»tP)) 




Proof. Theorem I is a proof of Theorem IV for p = 1. The theorem may 
then be proved either by regarding it as a special case of Theorem I and col- 
lecting terms, or by induction. 

Corollary. If, for each possible set of values of • > • , the 


ft 


terms 


(36) 

are all equal, then 

P[g’‘ 


... 


p(Q(r)Q<p)') = 


(36) 




(-1) 


|f( 1 . .+v<p) 


y(p)»0 




a»L 


Let, lor each value of A, (ft = 1, ■ • • , p), 

S(v“>, , <■'-') 

(37) 


:= E E 


• /'>)]. 


ipM 


It is apparent that by using (34) it is possible to obtain an expression for (37) 
which' does not depend explicitly on In fact 


r ... 2 (_i) 


5(v^ ■ • . , - E E I 




E ••• S E ••• E 

< 1-1 < A “1 


. . . , , ir'"*)]. 


(38) 


• • f 
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If the different terms of (37) are all equal, then 

t^h 


If the different terms of (38) are all equal, then 


, /'>) = ■' 2 ... 2 (- !)'“'+• • 

^ O )hO yCfc— 1 )hQ 

1 

(40) n(i“ -/*>;.<•>) ft 

V^ ' • • , 4‘'‘’ V"*’, ' * • j 

Theorem V. The probability that of the N events (26) ike first of superscript 
s occur and the remaining do not occur, (5 = 1, ■ • * , A — 1), and exactly 
events of superscripl 8 occur (s = li, . • * , p), fa 

kw-f<h) 




(41) 


i*, c'*-"’) = E ••• E (-1) 

,WmQ pf^O 


■•A 


Proof. The theorem may be proved, either by induction using Theorem II, 
or by obtaining disjunct sets as in Thieorem II and using Theorem IV. 

ConouLAEY I. If (39) is true for all sets of possible values of ■ . . , 
then 


(42) 


P(, «)..., (,n(«*'''’(3‘*'"') = 





it(p)-/(p) 




(-1)' 


<A)+...+r<p) 


t^h 



Corollary II. If (40) is true for all sets of possible values of • > • , 

then 


fc(i)-y(i) 




P(i«).wwj(0''"* «“■”')•= E E (-1)’“’+- 

»i(T5—0 v(pl — 0 


+ip(p) 


(43) 


i^i 


) ft (fc“’ 




Theorem VJ, The probability that of the N events (2B) the Jirsi eveids of 
superscript s occur and the remaining do not occur, a — I, « ■ • , £jf — 1, exactly 
events of superscript 3 occur (s = g, ,h ~ 1), and at least events of 
superscript s occur (a = A, • < ■ , pj'fs 
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f ... r 






;fp) 1 ..<93 ^ . .j{p3 

/ 

Proof, The theorem may be proved either by induction using Theorem III 
or by obtaining disjunct sets as in Theorem III and using Theorem V. 
Corollary I. If (39) is true for all sets of possible values of 

,.(p> 


JQ) „(fl+l) 

> j 


V V" ' ", 


then 


ifc(p)— /(p) 

_ ... E 

p(0)-iO v(p>*0 


PljU!:::?!!-’..,©''-’’®''-"') = E 

n v“) fl [(/■> + + /•>)l 


J-C 




Corollary II. If (40) is true for all sets of possible values of • • • , v 

then 


(p) 


fcdj^d) 

,E. ••• (-» 


V (I 1+ - * ■ -j-p Ip) 


iP<T7n0 




5-0 


Let U8 again consider the experiments (19), and let us assume that 
E^'\ {i = 1, ■ • • , r) has as its mutually exclusive results 

(47) Oi!' (i= 1, ...,fc'‘’)i(s= 1,2), 

Let Of, be the event which occurs if, upon performance of the experiments 
(19) at least one of the events Oj?, ■ • ■ , 0*? occur, and let 6(, be the 
event which occurs if and only jf Of, does not occur. 

We may state the probability that the event Ei , which occurs if and only if 
at least of the events On , (f = 1, • ’ • , occur, and the event Ei , which 
occurs if and only if at least of the events 0^ , (i ^ 1, ■ - • , occur, both 
occur. 

It is apparent that 

(48) P(EiE2) = 1 - P(Ei) - P(A) + P(EiEi), 

where Si is the event which occurs if and only if E, does not occur, (s = 1, 2), 
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From Theorem III 

PiS.) = E (-!)’'■’(*!'’’ - /■’ + ~ + /■’ + 1) 

(49) »(o-.o 

(«= 1 , 2 ), 

where 

fc(t) 

s"(it'*’~/‘’+>“ + i) = E 

<»li' ■ 

n {1 ^ — ■ • ' — = 1; 2). 

f-1 

From Theorem VI 

.. Pirn ^ 'e ‘ ' e' (- 1)'“’^'“’ n (t'-' - + /“ ; ^“) 

(50) ,(1M k{^0 1-1 

^ ^ _j. _f, 1)^ 

where 

(jfed )j^ (!)_„( I )^I) {J|;(l>|,<2)_„<g)^l) 

S(fc'« - j«> 4 - /■> + 1, fcO' - j<» + + 1 ) - E E 

P[i‘”’(A'” - /“ + K® + 1, ft* - /® + r* + 1)1, 
&nd 


P[g‘">(fc'‘’ - j“ + ■,'« + 1, ft* - i'« + + 1)1 = 

El- E P(o'A)- E i’CoSlJlk 

{-1 I, I'-i ji-i J 

the subscripts a, , (v = 1, • ■ . , 4- 1), being those oi the ^ 2 *** 

selection of + 1 events from events, and the substripts 

jSp , {fi ^ 1, ••• + 1), being those of the selection of 

^( 2 ) _ j{V ^ j^( 2 ) ^ 2 events from events. 

The desired probability is then obtained by substituting from (49) and (60) 
into (48). The procedure is perfectly general, and applies directly to situations 
in which p > 2. 

We shall now investigate the results obtained by requiring that the events 
considered satisfy a relation of implication. 

Let the subsets of fl 


(51) 


(s — Ij > ♦ . 

.P)» 

be elements of i, end let 




(52) 

Ei, C Eu , 

(i = 1, • • ■ 


if « < (. 
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It follows that 


(53) P{Bi.E<,) = P{E„), (t = 1, . . . . k), (s < t). 

Lstji < jt and let 

(54) 


A_ }* 


Let ji < ji < ‘ * < jt and let 


Q< = II II-Bi,, 


(55) 


<31 = n n E,., 


{1 = 1,2, ■■ .,p). 


(/= 1,2, 


(56) 


From (52) and (53), it follows that 

p(o.e;) = p(ril ft ft.1 

\L^“l J 


rti w 1 Jt \ 

n ft A. n sA 


(jo — O) (^ " Ij 2, ’ ■ ■ , p). 


Ijet Ji < J2 < ■ ■ ■ < jp and for each value of s, (s = 1, • ■ • i p)i consider a 
selection of 4“ events of second subscript s from (51). Let the p selections 
thus obtained be such that 

J* + Vi < ji+i , (s - 1, 2, . • • , p), (ip+i - h), 


and if Ei, is one of the events of the selection of events of second subscript s 
then the fact that t > s implies that Eu is one of the events of the selection of 
events of second subscript i. 

From (62) and (53)^ the probability of the occurrence of all the events of the 
p selections thus obtained is a function of jp + Vp events, a, of which are of 
second subscript s, (e = Ij ■ • • > p) where 

(57) Ml + P2 + • • ' + = i* + r», (s = 1, ' • • j p), 

and for a given set of values of ji , ja , * • ■ , jp the m« and v, determine one another 
uniquely, (s = 1, ■ ■ • , p). 

For a definite set of values of ■ , jp and m fip or ji ,•••, jp and 
vj , > • • , yp there will be 

(i<+i - j*; V,) = - jt’, jt+i - Hi - ... - Ht), (s = 1, ■ • • ; p), (i,>+i = k) 

possible distinct selections of + r, , (s “ 1, • • • , p) events of second sub- 
script 5, j, of which are preassigned, from j,+i events, (s = 1, ■ • • , p). 

Let these selections be arranged in some order for each value ofs,® — 1, ,p, 

and let 

(58) 5i,ij ... f„(^j , Hi , • - • , Hp) 

be the event which occurs when for all values of s, (s — 1, • • • , p), the events 
of the selection of H- v, events of second subscript a all occur.” 

It is understood that the j, preaesigned events of second subscript s are among the jj 
preaesigned events of second subscript I, (l > $) in the events (58), 
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A typical event (58) is 
(59) 5i--i(/Ji> 


ii+i", 


' * I /^p) ^ n n j 

1=1 


(jQ + — 0), 


There will be, for a definite events of second subscript s, {s - 1, ‘ , p) 

(60) li (ji+i - v»), ' (jp-j-i = h), 


events such as (58). 

For a definite set of values of /xi i • * ' » Mp there will be, for each value of s, 
(s " , 7?) 

(fc - • - Mi ; M.)i (« = 2, - . . , p) 

possible distinct selectionB of j, + v, events of second subscript a, 
of which are preassigned from k events, (s * 1, • • • , p). 

Let these selections be arranged in some order for each value of s, 

(s - 1, • • • , p), 

and let 

( 61 ) , i>(mi ) /^z > ■ ■ ■ } Ml.) 

be the event which occurs if and only if, for all values of s the events of the 
set of j, + vt events of second subscript s all occur, (s = 1, * • ■ , p), and 
the first subscripts of the events of the f,*** set of events of second subscript s 
are among the first subscripts of the events of all the selections of events of 
second subscript greater than s, (s =* 1, * ■ ■ , p), 

There will b6 

(62) (fc; Ml , M2 F * “ j Mp) ' 


events (61) which may thus be obtained. 

Theorem VII. The probability that of the pK eo&nis (51) the first j, euenfs of 
second subscript s occur and the remaining k — jt events do not occur, s = 1, • • • , p, 
is 


(63) 


fs~h t -h 

p(.QM- L E • I: (-ir'^'-+- •• 


.p-fl 


^ ••<p(m1i M2i ■ ■ ‘ f Mp)1f 


where the event Qi determines the j, — j,_i — v,_^ events of second subscript 
®, (s =* 1, t . • , p), which have as first subscripts all numbers 1, 2, ■ ■ • , j, which 
are not among the j,_i + numbers determined by the events of lower second 
subscript than s which are contained in ... (mi , ■ ■ • , pp). 

Proof. Expand (66) by means of Theorem IV. 
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Corollary. If, for each fixed set of values of /ii , ^ 2 , • . ■ , jUp the terms (58), 
in number (60), are all equal, then 

,,, p(o,q;) = '2‘ "i'- (-1)-—. n (i.« - y. .) 

^I?l-..l(^Xl, #12, ■ ’ • , #lp)] (jp+l = fc)- 

Let 

<^Mi) (t-pi— 

r(#llj #12, • ■ * , #tj>) ' • * ■ S 

(65) ii-i (p«i 

f*2, * • • , #ip)l' 

If all the terms of (66) are equal, then 

(66) Tifiit * ' • , #ip) = (h; /n, jU2, • • • , /ip)P[?i...i0n, • ■ ‘ , #«p)]. 

\ 

Theorem VIII. The probability that of the pK events (51) exactly j, events of, 
second subscript s, s — 1, . • - , p occur, is 

Pbv;.)= E E ••• E 


( 67 ) 




ft (/i.iJt - Ml “ * ■ ■ - M*-i) Ma» ' • • , Mp)* 


Proof. If .4{fi, /j) is the subset of Q determined by the requirement 

that exactly j, of the events (61) occur (s = 1, • • • , p), then A(,', is the 
sum of 

(^I ii ) ia — Ji , ia *“ is , ' • • , jp — Jp-i) 

disjunct sets which may be obtained by replacing P by A in (56) and forming 
(66) for all selections of j« — j,-i occurring events from k - Jj_i events, 
(s = 1, • - ' , p). By Axi^m V, , ,,) is the sum of the probabilities of 

these disjunct sets. 

Substituting from (63), it is noted that all terms (61) which depend on the 
same m« , (^ = 1, * • ■ > p), have the same sign and that all 7 '(mi i j * ■ ■ Mp) 
for which 

0 ^ j'* ^ ji+i jt , (s = 1, • • ■ , p), 

appear and only those appear. Furthermore any particular term (61) will 
occur in those of the terms (63) the j, — j,-i occurring events of second sub' 
script s, (fi = 1, • • • , p), of which contain a fixed Fh-i events, the remaining 
j, — j,_i — V .-1 events being a subset of the Mi events of second subscript s, 
(s — 1, ' • • , p), that actually appear in the particular term (63). Hence the 
coefl&cient of Tita , ■ • ■ ^ M;,) is 

J"1 


(mo = 0). 
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ConoLLARY, If (66) is true for all sets of possible values of ;ii , jua , • • ■ , 
then 

p» W-'S'S 

^i"0 yp"«0 

/*qN 

(fejjif ~ ji ' }jp ~ jp-i ^ 

Maj ' ‘•jMp)J. 

Theorem IX. The probability that of the pk events (51) at least ja , but not more 
than iffi-i , events of second subscript s occur j (s *= li • ■ ■ , 0f)> exactly j, events 
of second svbscaripi s occur, (s — p 4* 1| • • ■ r P) 

(69) f'uiVu-’-*) = S XI 2 ^2i * */ } Oj 

fla“0 

where, if a 1 in the position is denoted by 5,- , (t = 2, ■ * ■ , g), 

^Uq-h ?p)(f» fill • ■ ■ j » Oj • ‘ ^ 0, fitj+i, ’ * • ^ ^ti; 9, ■ ■ • j 0, • • • , fiy^+i, ■ • • , fiff) 

“L-" i; £••• S L ..-S (_i)'iW.-+., 

fp-o Kb+i-O ►’ 1“® 

(^0) (jl + Vl- 1; n). . • • (jV, + Vy, - jyt^l - Py,_l - 1 ] ^-y,) 

(iri + Vy^ - jy, “ Vy, “ 1; Vy^) ‘ < • (jp + Vp - jp-l - Vp-\\ Vp) 

T{jl 4- n, ■ • • ,jyt + yy^ — jy,_l ” Vyi-\, 0, ■ < * , 0, 

hi + V'ii ~ hi "*'?*» ’ ” )jp Vp ^ jp^i — Vp-i). 

Proof, We note first that there are terms in (69). Since 
(71) 


pOj.- ■■./,) . T P 

Cip+is' ' sfp) * ‘ ' ^ ^p+i ■ * 'ip)} 

Kj— /a Xi“*ii 


the theorem may be proved by a process of repeated summation. Ftom (67) 
and (71) 

M X^i \ |-X t k~ip 

(,«■■•(,) = E L E ••• E (-i)''+-+-+" 

I-I-O •J.p-O Kp— 0 

(72) 

(Xi + vi] j'i)(Xj + va — Xi -- fi; Ka) • ■ ■ (jp + Vp — pp) 

r(Xi + Xj 4- *^2 “ Xi — j»i, . . . , jp 4- Pp — jp_i — vp_i)- 

For fisfed values oi Xj , Xa , ■ • • there will occur jti (72) all terms 

(73) TUi + ft , Xi 4“ ii - ft , • ■ * , jp 4- J'p - jp-i - vp-i), 

(01 = 0, . . . , Xj - ji), (0 < j', < X.+1 — X,), (s = 2, ■ . . , p), 

(Xd+* = jp+i 8 = 1> • ■ - , p “ fif)) 

and any definite term (73) will occur in all 
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for which 

0 < a < i9i . 

In (74)j the definite term (73) will have coefficient 

(76) ■ . . (ip + yp - ip^i - Pp -1 , Vp), (a = 0, 1 , ■ ‘ , ^Oj 

(^1 = 0, • • ' f Xa — ii) . 

Hence, in (72) the definite term (73), will have coefficient 

(_!)/!,+..+ ... + ft _ 1; ft)(X, + _ ft ; p,) 

••■ (Ji + "v- jp-l - i 'p)i 

and 

(76) •/,>(!)' 

We now evaluate 


(77) 


p(/i/i) x ' p(/i) 

Xj-ii 


(78) 


For any fixed values of X3 , > • • , Xp , there will occur in (77) all terms 

7^(ji “t“ |3i t H + 1^2 — ii — > Xj + J'a — ji — ) 

■ ■ ■ j ip + J'p “ ip-i “ •'p-Oi 

for which either 0 < ^2 ^ X3 — ja j 0 < < ja — ji 1 or = jj — ji + y, 

0 < 7 < Xb — is ; 0 < /la < X3 “ is — 7- 

Let 0 < i8i < ia - ii - 1; 0 < <32 < Xb - is . Then the term (78) will occur 

in all 

(79) ■f*o'j+a,Xj,"-,ip)i 

such that 

0 a ^ i5a . 

In (79), (78) will have coefficient 

+ ft _ 1; ft)(j, + ft _ 3, _ ft _ 1; ft _ a) 

(Xs "1" .*3 — 5*3 0a i fa) ’ ' ' (jp 'i' Pp — jp-i — Vp-i J J'p). 

Hence in (77), (78) will have coefficient 

- 1; j9,)(ij + ^2 - ii “ /?! - Ij ^2) 

(Xa + vj — ia jSa ; pj) • ■ ■ (ip + Vp — jp-i — Vp-i ; >'p), , 

(j9i = 0, . ‘ . ,ia — ii — 1), (ft - 0, . • • , Xj - ia), 

(v, = 0, • • • , X»+i — Xi), (5 = 3, ■ • • , p) ; 

(Xp+. = io+.), (s - 1, • ■ • , P - 5) 


(80) 


(81) 
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Now let j3i = j 3 — ji + 7 ; 0 < 7 < ^3 — 1 0 < < Xs — ji — 7 . Then the 

term (78) will occur in all terms (79) such that 

y ^ oc < ^ , 


and in ( 79 ), (78) will have coefficient (80). Summing for or, (a = 7 , . * . , /Sj), 
we obtain as the coefficient of (78) in (77) 


and 


Hence 


0, 


if ^4 > 7, 


+ A - 1; A)(X. + h - A ; ►.) 

' * ‘ (jp “I" ^p — f ^p)j if ft ^ T' 


(82) 


•/,)(!, 1) + 8)* 


If we examine (82), we note that the result of summing with respect to Xj 
has been the replacement of (76) by two sums which are similar to (76) in that 
the next summation index, in this case Xs , occurs in exactly two limits of sum- 
mation. If it can be shown that, the two sums which joccur in (82) each result 
in a pair of sums after summation with respect to Xs , or more exactly if 


>1+1 


(83) ^i+l-Zr+l 


“ • • • j 1 ) + ^(Ki+|,''',^p>(l> ^3i • ■ ' j I 0) 


then the proof will be completedc-^ 

Since the truth of ( 8 ^) may be demonstrated in exactly the same way in 
which (82) has been shoWn to be true, the theorem is proved. 

CoEOLLARy. If ( 66 ) is true for all sets of possible values of , pa » ■ • ■ f pp 
then 


^1) ‘ ■ ^-rn Oj • • ’ j 0> » • ■ * ^ 9, • ■ • 0, • • • , ’ a O 

■=S' - E S E. ••• E (-i)'‘+ '+" 


ri-O 


Ul + J'l - 1; Vl) • • • ijyt + Vy, - Jyj-1 “ I'y.-l “ Ij Py^) 

(84) iiy^ + Vy, - iy, -Vy^-1] Vyi) • • • {jp + Vp - J Vp) 

(A:;il + Vl, ■ ■ ■ , + py^ - jy^_l - j^y,-l, jyi 

+ f'n ~ ht ~ Vyii ” ‘ yh + Vp - jp^i — I'p-i) 

■PWl-.-l(jl + ‘ J iy, + Vy, - Jy,_l - 0, ' ‘ , 0, 

hi '+ Pya “ jYi - >'y, y • • ■ r Jp + J'p - ji^\ “ Vp-i)]. 



THEORY OF COMPARATIVE STATISTICAL ANALYSIS 


176 


Let us again consider the experiments (19) and let have as possible results 
Oj, (j “ Ij • ' ' j ^)i (s “ 1, 2) (i 1, 2j • • • , r). 


Let 

(i = 1, ■ ■ • ,r), 

i.e. occurs whenever Oj-^^ occurs. Furthermore let the outcomes 


o[¥, oil\ • • . , oi^ 

be mutually exclusive. 

Let 

, 0,., 

occur if and only if none of 

oSl',o!?,...,oS? 


(» = 1, 2), 


occur. 

We may wish to know the probability that at least of On , • > • , Oh and 
at least ji , ji > ji , of 5ij , ^ 22 , ■ - ■ » Ow occur. 

From Theorem IX this probability is equal to 

(85) P*'-'’’ = P(l, 1) + P(l. 0), 


where 


KJ— 0 0 

(ia d- V2 - ii - Pi - 1; yiWji + Pi,ia + Pa “ ji - vi)j 
and 

B(l, 0) = (-l)'’(;i + n - 1; + vO, 

From (63) 

(86) T{ji Pi, j'a + Pz “ “ Pi) — S 2 

“b *'1 ) h + P2 “■ ji Pi)lj 


where, from (61) 


"b J2 + P* 3l — Pi) = n Oa,l H Oa^j , 


the subscripts 

(87) ai,ai, . . . , 
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being the first subscripts of the selection of ji + vi events of second sub- 
script 1 from 

Oil , O 21 ) • ■ ' > 0 * 1 , 

and the subscripts 


being the first subscripts of the selection of j 2 -f ^2 events of second subscript 2, 

ji + of which are (87), from 

O 12 , O 22 I • ‘ f 0*2 . 


It is easy to see that 

+ J'l; h + V2 ^ ji 


I- { fl+l"! 

'i)i = n 1 - E Pio'Jii) 

<=i L y-i 




Furthermore 

(88) r(ji -I- Ml) « 2 

where 

r ( Ji+ri 

f (fc(3i + V.)] = n a - £ p(o‘‘,’,)|. 

Substituting from (86) and (88) into (85) the desired probability is obtained. 
It may be remarked that theorems which have the same relation to Theorems 
VII, VIII, and IX that Theorems IV, V, and VI have to Theorems I, II, and 
III may be obtained without much difllculty. ; 
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REPLY TO MR. WERTHEIMER’S PAPER 
Richmond T. Zocii 


The attainment of rigor both in applied as well as pure mathematics is a slow 
process, and for this reason criticism of my paper, if constructive, is welcomed. 

Properties like continuity, differentiability, and dimensionality are local 
properties, that is to say a function may be continuous or differentiable over a 
certain range but not outside this range, or otlxerwise a function may be con- 
tinuous or differentiable over a given range except for singular points. 

The presence of singularities in functions does not necessarily cancel tlieir 
utility. Thus the function 7j ^ tan a; contains points where it is discontinuous, 
but ordinarily it is regarded os a continuous function and the presence of these 
singular points seldom handicaps one when working wth this function. Simi- 


larly, the function / = i — is a function which satisfies all four Axioms as 

M2 

stated in Whittaker and Robinson’s book and expresses the mode of Pearson's 
Type III curve as a symmetric function of the measures. Tlie fact that this 
function is not differentiable along the lino aji = 3:2 = xs = • • ■ - will never 
handicap the investigator for unless the frequency distribution is clearly skew 
the Type III curve would not be used to represent it. 

It seems that Mr. Wertheimer bases nearly all his criticisms on the tacit 
addition of the word everywhere'^ to Axiom IV as stated in Whittaker and 
Robinson’s book. The word “everywhere” is not in the statement of Axiom 
IV and I assumed nothing else than stated in the axiom. 

If one deliberately adds the word "everywhere” to Axiom IV then nearly all 
my criticisms of previous writers are incorrect, unfair, and unjust. Ho^vever, 
it does not seem that clearness and rigor in mathematics are increased by read- 
ing into an axiom a word that is not there. 

Consider first the criticism in my paper which remains valid even when the 
word “everywhere” is added. (Schimmack uses the word “everywhere” on 
page 127 although Whittaker and Robinson do not.) Botli Schimmack and 
Whittaker and Robinson proceed os at the top of page 217 of the book by the 
latter authors with the statement: “In this equation make fc — > 0 then each 


of the quantities 


L3a:„J 


tends to a value which is independent of the a;'s 


If 


T'his statement rests on the tacit assumption that the qu anti ties ^ are func- 


tions of k. Even if such were true the use of tacit assumptions in a rigorous 
proof is objectionable, but os a matter of fact these quantities are not functions 
of k. Thus the particular proof given in Whittaker and Robinson’s book as 
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Tvell as in Schimmaok’s paper is altogether lacking in rigor even when the word 
■'‘everywhere” is added to Axiom IV. Both Sohiaparelli^B and Broggi^a proofs 
.appear to be entirely rigorous if the word “everywhere” ia added to Axiom IV. 

In preparing my paper I assumed that no prohibition on functions which had 
.singular points Was contained in Axiom IV, In other words, I assumed since 
the word “everywhere” did not appear there was no valid objection to intro- 
duce and discuss functions with singularities. The functions I introduced are 
■everywhere continuous but they are not dijferentiahk along the line in Euclidian 
n-space defined by = icj = • • ■ - They are differentiable at every 
■other point in the space. 

It seems to me since Axiom IV as stated in Whittaker and Robinson's book 
-does not exclude functions which are not everywhere differentiable that all my 
criticism is fnir and just, and moreover nearly all my statements are correct. 
Mr. Wertheimer is entirely correct in pointing out that the words "everywhere” 
on page 181 of my' paper are contradictory. As a matter of fact the whole, 
paragraph beginning with line 7 on page 181 appears to me, on reexamining it, 
to be unsatisfactory. Except for this single paragraph I believe my paper to 
be rigorous, but I welcome further criticism. 

Mr. Wertheimer's conclusions in his paragraph number 4 are clearly errone- 
ous. To show this, conside*’ a function of Ai. As A; ^ 0 any one of three situa- 
tions may arise, namely; (1) The function may become infinite, (2) the func- 
tion may become indeterminate, that is it may take on any value whatever, 
(3) the function may approach a unique finite value independent of fc. Neither 
Scbimmack nor Whittaker and Robinson nor Mr. Wertheimer has established 
. as a definite fact that the particular type of function here in question approaches 
a unique finite value independent of A: as k 0. The truth of the matter is that 
this conclusion cannot be established because the function in question does not 
involve k either explicitly or implicitly. 

In conclusion there are two things I wish to emphasize. First, even when 
the word “everywhere” is added to Axiom IV, the proof given in Whittaker 
and Robinson's book is faulty, but if one consults the references given there 
in the footnotes he will find two other proofs which are rigorous with this ad- 
dition to Axiom IV. Second, the mode of a skew bell shaped Pearson Fre- 
quency Curve satisfies all four axioms as stated in Whittaker and Robinson’s 
book, and the fact that these expressions for the mode are not differentiable 
along a certain line is never a handicap to the statistician. 
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CORRELATION SURFACES OF TWO OR MORE INDICES WHEN THE 
COMPONENTS OF THE INDICES ARE NORMALLY DISTRIBUTED 


By Geobqb a. Bakeh 


Indices are widely used in statistical analyses/ In many cases incorrect 
conclusions are drawn because indices are not uncorrelated or independent even 
though all of the component variables are independent. In a previous paper* 
the distribution of an index both of whose components follow the normal Jaw was 
given exactly i.e. without approximation. The purpose of the present paper is 
to give the simultaneous distribution of two or more indices when each of tlie 
components follow the normal law. The case for two indices will be discussed 
in detail and the extension to more indices will be indicated. 

Let % j and icz , be correlated variables each bemg normally distributed 
about their respective means mi, 11^^771$, with standard deviations ai,<rsf trs, 
and let the correlations between the variables in pairs be represented by ru , 
^13 , rig . Then the simultaneous distribution of these tliree variables will be 


1 

(2T)lEVi(T8£ra ^ 


1 1 rii!ii{a;i “ miY Ruixi - Wi)* ' ^33(^3 - Wa)* 

no J ‘ 2 2 

(Tl. ffs 


( 1 ) 


^ fa - 


4, 2 Ei 3 fa ~ ^ 

7105 


"h 2 Ri!3 


- mi)^ - wj)' 


O’! (73 




where 


H = 


1 rii fis 

ri3 1 ris 

r]3 rij 1 


and Rif are the respective second order minors of R. 


1 Rietz, H. L. "On the Frequency Distribution of Certain Ratioa," Annals of Mathe- 
matical Statistice, Vol. VII, No. 3, Sept, 1936, pp. 146-163, 

2 Baker, G, A,, “Distribution of the Means Divided by the Standard Deviations of 
Samples Prom Non-homogeneous Populations," Annala of Mathematical Statistics, Feb. 
1932, pp. 3-6. 
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If we make the transformation 


- 

Zl = — , 

Xs 

Xi 3 = Z1Z3 

Zs SS — 

Xa 

X2 = JZ2Z3 

Za = Xif 

JTa = Za 

dXidXidxs = 

2 a dzi dzi dzs 


which is certainly valid if Xi,Xi,Xi, are all positive, then (1) becomes 

1 1 [* RiiiziZs — mif 




2 ni 


i 


<r 2 


{ 2 t)^R^ (ri<ri(rs 

^2^ _J_ fi38(g3 - mi? (gigs - mi){ztZz ~ frh) (ziza - miXzj -- m^) 


^3 


cri<r2 <7 i<T3 

(ziZi — nh){zi — W 


22^23 ' 


0*2 (Ta 


- gj dzi 


dzf dzi r 


H Xi , aca , X 3 are all positive the corresponding diattibntion of Zi and 22 can be 
obtained by integrating (2) between the limits 0 and <» with respect to Zi . 

If a5i, iCa, Xi are all negative 2i and Zj are again both positive so that in order to 
get the total distribution for Zi and Za it is necessary to add to the integral of (2) ' 
between the limits 0 and » with respect to Za the similar integral of (2) with za 
replaced by — za . The result is 

f Vs -3*3 j I 6^ ^/tt 
“V “T - 1 / ' e ” dz -1- — j- -5— 

\/2 a* a Jo 0 v2_ 

a= — *!+-;■ 'T*“T"b ^ 1^2 T d 

(Ti (fa c^'1^2 cri(r3 (r2^?'a 


(3) 

where 


1 ^ 


(2ir)W 


tricr2 0*3 


b = 


■Bn ^ , B23 I , Baa ■ 

—5- jniZi H — ^ WI2Z2 d — j- + 

<ri 0's ca 


^ *M _L Ml i4 L M. 

ZiWla ~r WI 1 Z 2 "T WaZi 

O'! Os o'itf'2 O'! era 


d ^ wzi d“ — “ WI 3 Z 2 + 


O'! (73 

, -Bii ^2 , -Bss a , -Baa s' , 2 i£is 2 Bia , 2B2a 

^ — 2" d — 2 "b — ^ JRa d“ W1WI2 d“ : Wi JJta d“ - — “ ttlsJtta • 

<J'i Vs ffa triff2 viva vsoa 


■Baa 

csva 


Baa 

vava 


The, same result (3) is obtained for Zi , and Zs negative, z^ positive and Zs 
negative, Zi negative and Zs positive. That is (3) is the simultaneous distribution 
of zi and zs. The extension to more than 2 indices is immediate. The form of 
the distribution of the indices and the denominator variable is the same aa (2) 
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except that a, 6 , and c, the coefficients of zl , 23 and the constant term respectively 
in the exponent of e, will be different in that they will include the new indices and 
the exponent on the denominator variable will be the same aa the number of 
indices involved. The distribution of the indices will again be obtained by 
integrating from 0 to « with respect to the denominator variable. 

The case when all of the variables XijX^,X 3 are independent is especially 
interesting. If , na , are all zero then i? = iJu = = Jfjj = 1 , sr= 

= i? 2 a = 0 and a, &, c, become a', }}\ o’, respectively, 


1 

iTj (Tg 


% 


V = 


miZi I jn2 22 ms 
2 T" 8 ' a 

ff2 ffs 


2 8 2 

. mi , m 2 ms 

^ — 2 ^ 2 

(fl 0*2 


Under these conditions and the further condition that , wta , m 3 are large with 
respect to cri , va , 0-3 respectively so that the integral term of (3) maybe neglected 
(3) becomes 


/mici . I7l4Cq 



It is clear that 2 i and 22 are not independent in the probability sense for dis- 
tribution (4). 

The question as to the possibility of having the variables independent and the 
indices independent at the same time arises. Denote the distribution functions 
of aji , iC 2 , a; 3 , by Xi(xi), X^ixi), Xs(zs) and of 21 , 22 by 2 i( 2 i), 2 a(z 2 ). Then, if 
Xi > 0 , f = 1 , 2 , 3 it is necessary that 

(6) [ Xi{z3Zi)X2{z3Z^Xs(zs)zs dzs = Zi(zi)Zs{z^ 


a and h being suitable limits. 

For instance, let 

Xiitcd = i, 

a;? 

= i, 

3:2 

Xsixl) =. xl 


1 < iEi < 3 


1 < aja < 3 


1 < Si < 2 
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then 


2l 

^3(23) = 

2S 

for value of Zi and 22 within a straight line aided area the corners of which are 
{h i)i (^j l)i (Ij 1) (If 2). Zij and Zz are not uncoirelated throughout their 
entire set of values but are for this particular set of values. Thus is appears 
that it is possible that the indices may be independent when the variables are, 
but not necessarily so. 

Indices should be used with care since it is very easy to draw invalid conclu- 
sions from the consideration of them. Usually it is better to use partial corre- 
. lation analysis to remove the influence of a third factor than to calculate indices. 



THE TYPE B GRAM-CHARLIER SERIES 
By Leo A, Aeoian 


While much attention has been devoted to the Type A Gram-Charlier series 
for the graduation of frequency curves, the Type B series has been somewhat 
neglected. However the numerical examples to be presented later will show 
that the Type B series is very useful for the graduation of skew frequency 
curves. Wicksell^ has demonstrated that the Gram-Charlier aeries may be 
developed from the same law of probability which forma the bnaia of the Pearson 
system of frequency curves. Rietz^ following Wicksell gives a derivation of the 
Gram-Charlier series based on the binomial (q -f p)". Jordan® gives a method 
for fitting Type B based on certain orthogonal polynomials which he calls G. 
He uses factorial moments because of the resulting ease in finding the values' 
of the constants. 

We shall consider the Type B series for a distribution of equally distanced 
ordinates at non-negative values of a:. We shall find the values of the first few 
terms of the series and shall also shew how the values of later coefficients may 
easily be found. We write the Type B series in the form 

(l) F{x) = Cfl "k CiA^(a;) 4- C3A®^(x) + C5A®i^(x) + CaAVC^r) 

where 


( 2 ) 


\p{x) = 


e M 


xl 


m = )ji[, the mean, 


A^(x) ^ ^(x) - \/^(x - 1) for X = 0, 1, 2, ■ ■ • s. 


Let f(z) give the ordinates of the observed distribution of relative frequencies, 
SO that 2/(x) = 1. To determine the coefficients co , ci , Ca , • ■ . , co , we have, 
using the method of moments, 

S[co^(x) 4- CiAi/'(x) 4" ciAVW "k 4" 4- cbAVW] - S/(x) = 1. 

2x[co^(x) 4- CiA^W 4- ■ 4''CflAV(a:)] = Sx/(x) = m, 

2x^[co^(x) 4" CiAip(x) 4* 4- C6AV(i5)] — = fi2 ■ 

1 

(3) 2x®[co^(x) 4- ' 4- CeAVWl - Sa:®/(x) = 

2x*M(x) 4^ 4“ CaAV^(x)l = 2xV(x) = ► 

2x®[co^(x) 4“ 4"^^ ~ 2x/(x) s= /i 5 , 

2xW(x) 4- + CflAV(a;)] »= 
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Hence we must find the values of 





= 0, 1, 2, 3 • ■ • 
p = 0, 2, 3 • • ■ 


defining AV(a:) = '/'(a?), We assume that we are dealing with distributions 

flO 

in which s is large, and that the error involved in substituting S for 




_a 

2 is negligible. To find these summations in a straightforward 




manner would involve too much labor, so we shall briefly discuss some properties 


e M 


of the generating function, = — ■ — , the Poisson exponential, very useful 

X I 

in the graduation of frequeney distributions of rare events. The first eight 
momenta about the origin are : 


/ij = 1 /ij ts w - fii m -jr ^ 2a;V(a?) 

= m + 3m* + m* = SxV(») 

/ij = m + 7m* + 6m* + m* — SxV(®) 

(6) as - m Tf 16 m* + 26m* + 10m* m* =* SxV(x) 

a! = m + 31m* + 90m* + 66m^ + 15?/? + m* = SxV(a:) 

a! = m + 63m* + 301m* + 350 + liOm* + 21m® + m* = SxV(x) 

aJ = m + 127m® + 966m* + 1701m* + 1060m® + 256m® + 28m^ + 

=t Sx®^(x) 

These may be found by the formula given by Jordan,® 


(6) 

Proof ; 


- n(,: + ^). 

#(a?) _ #(x) _ , . V 
dm m ' 


We multiply by x" and sum, giving (6). This result may readily be proved also 
by means of recUTsion formuias without differentiation. Now we must find the 
values of 


£ 


9 


x"A*’^(x) 


We do this by proving 


=5 0, 1, 2, ■ • • 

'p =* 1, 2, 3, ' * • 


( 7 ) 




E E I'i'iKi). 


{C^OO 
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Now 

(8) = ^p(x - 1) - )p(x) = -Ai//(a;). 

Hence 

+ (2)^* - 2) + . . . + (-l)V(i - s)], 

since AV(®) = 4'i^) — - 1) + - 2) + • • • + (-l)V(j: ^ s). 

Then by (8) 

- 1) - Hx) - - 2) + - 1) 

+ - 3) ~ (^2)'*'^® “ + ■ " + - 4 - 1) 

- (-1)V(* “ «)j- 

(9) ^ AVW = -^{x) + (I” I ^y(x - 1) - 2 - 2) + • . . 

- (-l)'+(i - « - 1). 

= - - (* t 'K* - + (' 2 '^(® '- 2) + • ■ • 

+ (-1)V(* - s -* 1)1. 

= -A'+V(*). 

We multiply (9) by z^, sum with respect to x, giving (7)’. 

Thus by use of (7) and (6) we get: 

SAV(a:) = 0, p = 1, 2, 3, - . . 

= -1. 

(10) + 

2a;®A\^(a:) = — 3m^ — 6w — 1. 

^x^A^{z) = —W — 18m^ - 14w — 1. 

23a!®Ai/'(®) — -'5m^ — 40m^ — 76??2^ — 30w — 1. 
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2a:V(a:) = -6m® - 75m' - 260m® - 270m'.- 62m - 1. 
l^xA^Hx) = 0, Sx'aXo:) = 2, l:xW^|^{x) = 6m + 6. 

XxW^p(x) = 12m' + 36m + 14. 

Sa;®AV(*) = 20m® + 120m' + 150m + 30. 

XxW4/{x) = 30m' + 300m® + 780 m' + 540m + 62. 

, Sa;A®^(a;) = 0, 2xW^P(x) = 0, ■SxW4^(x) = -6. 

1:xWH^) = -24m - 36, 2xVi(x) = -60m' - 240m - 150. 

2xW\l/(x) = —120m® — 900m' — 1560m — 540. 

(10) Sa;A'^t(a;) = 0, Soj'a'i/'Cx) = 0, Sx'aV(x) = 24. 

2a;®A'f (.t) = 120m + 240, 2a:®A'^(a:) = 0. 

Hx^A^ix) = 360m' + 1800m + 1560. 

2xA’’iix) = 0, 2xA^^p{x) = 0., 

l^xWiix) = 0 , IxW^ix) = 0 . 

ZxWi/{x) = 0, 2x®A'iA(a:) = 0. 

7:xWfix) = 0, 2xWHx) = 0. 

-ZxA'‘f{x) = -120, 2a:®A®i/^(x) = 0. 

2x'A®r/'(x) = -720m - 1800, 2xW<p(x) = 720. 

Finally we substitute from (5) and (10) into (3), and for we substitute 
Mn = 2 Heuce 

Co = 1 
Cl = 0 

C 2 = Hm 2 - m). 

(11) Ca = — ~ (/xs 3/X2 + 2m). 

C 4 = 6 /X 3 + ^ 2(11 6 m) + 3m(m — 2)]. 

C 5 = — 10ju4 — ju3(10m ~ 25) + 50^2 (m — 1) — 4m(5m — 6 )]. 

ce = ^ [m 6 - 15;^6 + iij(85 - 15m) + /laClSOm - 225) + M^5ni‘ - mtn 

+ 274) - 15m® + 130m' - 120m]. 

It may be asked whether criteria may be given as guides for the use of Type B. 
In general Type B may be tried if either the skewness of the distribution to be 
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fitted is considerable, ag = > ,6, or if m = jua = Ms approximately. The 

M2 

latter condition strictly would mean that alone is sufficient for a good 
graduation, if the fourth moment, ^ 4 , is not used. The examples which follow 
are arranged to facilitate comparison with the Pearson system of frequency 
curves. We have an example each of Type I, III, IV, V, VI, and an example of 
the normal curve. 

Type I. Table 1. Here as > .6 although m /jt 2 Ma ■ The first four 
moments, unadjusted, give an excellent fit by Type B, which is not quite as good 
as Type I. The degrees of freedom, according to Fisher,^ have been taken into 
consideration here in applying the test. The two classes 13, 14, were grouped 
together for the x^^ test. The actual numerical work is easily done on a cal- 
culating machine, although logarithms are necessary to find the value of e“'". 
This example and the remaining are all taken from Elderton^ with the exception 
of Type IV which is from A. Fisher.® 

Type III. Table 2. The unadjusted moments are used. Here as — 2,0833 
> .6, and m Ma approximately. The fit by Type B is slightly better than that 
by Type III. • We have for Type III P(x^ > 12.8) = .007, n = 3, while for Type 
B, P{x^) > 9,4 = .025 n = 3. Moreover the standard error of prediction for 
Type III is 11.2 and for Type B is 7.7. 

Type IV. Table 3. The rough moments were used. Although ag = .48 < .6, 
Type B gives a fine fit since m = jLt 2 = ms approximately. Here the results are 
given for Type B using 2, 3, and 4 terms of the series. This was done to show 
how the distribution changes with the addition of more terms. The superiority 
of Type B over Type IV is evident. The results for Type IV are taken from the 
class notes of Professor C. C. Craig. 

Type y. Table 4. Using the adjusted moments we have a comparison among 
Types V, A, and B. While the graduations may seem satisfactory, the ix? test 
shows that the fit is poor in each case: The order of merit is Type V, Type B, 
and then Type A. The negative frequencies which appear in Type B may be 
due to the use of the adjusted moments. If we use the rough moments, the 
negative frequencies disappear. On the whole the fit by means of the adjusted 
moments is superior. 

Type VI. Table 5. Type VI using the adjusted moments gives an excellent 
fit. Even though ag is considerable, and mz = Ms approximately, four moments 
with Type B give a poor fit, and five moments, adjusted, achieve a very small 
gain. Five moments using the unadjusted moments give some improvement, 
but the — 2 frequency in the first class is objectionable. 

Normal Curve. Table 6. The normal curve provides a fine fit. P{x? > .9) = . 
.96, n = 6. The first two and the last two classes were grouped together for the 
test. The fit by Type B is less probable, P(x^ > 8) = .15, n = 5. Type B has 
two discrepancies, the negative frequencies, and the fact that the total fre- 
quences (neglecting thO —1) is 352. That Type B does so well is in itself 
quite amazing! 
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TABLE 1 


X 

Actual frequency 

Frequency computed 
by Pearson Type I 

Frequency given 
by Type B 

0 

34 

44 

42.4 

1 

145 

137 

121.3 

2 

156 

149 

168.7 

3 

145 

142 

156.8 

4 

123 

127 

120.5 

5 

103 

108 

94.9 

6 

86 

88 

82.9 

7 

71 

69 

72.2 

8 

55 

51 

56.7 

9 

37 

36 

38.0 

10 

21 

24 

23.1 

11 

13 

14 

12.0 

12 

7 

7 

5.7 

13 

3 

3 

2.4 

14 

1 

1 

.9 

m = 4.175 

aa = .712247 

Type I Pix^ > 

4.36) = .88 

M2 = 7.66237 

<14 = 2.95214 

n (number of degrees of 

M3 = 15.1069 

ca = 1.74368 

■freedom) 

= 9 

M4 = 173.326 

C3 = - .078298 

1 Type B P(a;2 > 9.67) = .37 


C4 = + . 094592 


71 = 9 


Fix) = fix) +1.74368 A^fix) - .078298 A^fix) + .094592 A*fix). 


TABLE 2 
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TABLE 3 


Number of alpha particles from a bar of polonium in intervals of | of one minute 


X 

Frequency 

Type IV 

Type B 

2 terms 

TypeB 

3 terms 

TypeB 

4 terms 

0 

57 

50 

49.5 


58.2 

1 

203 

183 



199.8 

2 

383 

392 



386.1 

3 

525 

544 

532.3 

533.8 

523.9 

4 

532 




532.1 

5 

408 

417 



418.2 

6 

273 


254.8 

254.4 

260.2 

7 

139 

131 

137.1 


134.0 

8 

45 




56.7 

9 

27 

26 

26.1 


22.9 

10 

10 



9.6 

8.6 

11 

4 

4 


3.1 

3.6 

12 

0 

1 

.9 

.9 

1.6 

13 

1 

0 

.2 

.2 

.8 

14 

1 

0 



.3 


w= 3.87155 aa=. 47844 
Ui= 3.69477 = 3.506536 

3.39791 
ixi s 47.86888 

Fix) = ^{x) - .08839A2^(a;) - .00930AV(®) + .168lOAV(a:). 

Type B; 4 terms P{x^ > 4.50) = .72, n = 7 
Type IV P(a:'>10.8) = .15,n = 7 
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TABLE 4 


Mortality Among Female Nominees 


X 

Dea’’h3 

Elderton 
Type V 

Type A 

TypeB 

2 terms 

TypeB 
, 3 terms 

Type B 

5 terms 

TypeB 

5 terms 

0 

4 

4 

2 

1.4 

-6.9 

-.4 

4.1 

1 

18 

10 

15 

26.3 

7.1 

9.4 

13.1 

2 

53 

80 

78 

109.7 

100.1 

84.6 

77.4 

3 

265 

261 

235 

248.3 

268.4 

252.3 

242.5 

4 

438 

441 

426 

379.5 

418.8 

425.9 

427.4 

5 

525 

480 

521 

432.7 

461.0 

484.0 

494.1 

6 

342 

381 

411 

388.8 

388.4 

402.6 

408.1 

7 

253 

247 

225 

285.4 

263.5 

259.0 

253.9 

8 

128 

137 

107 

170.8 

145.5 

132.2 

124.9 

9 

82 

68 

66 

84.3 

68.3 

58.6 

54.1 

10 

28 

32 

44 

32.9 

28.2 

26.2 

26.4 

11 

12 

14 

22 

8.6 

11.0 

13.9 

16.4 

12 

8 . 

6 

8 

-.01 

4.7 

8.2 

10.7 

13 

6 

3 

2 

-2.1 

2.1 

4.3 

5.9 

14 

1 

1 

0 

-1.5 

1.3 

2.0 

2.5 


Adjusted moments: 
n = 5.30435 a, = .703564 

fii = 3.573345 ai = 3.996196 

w = +4.752437 

in = 51.02659 

in = 193.439125 


Rough moments: 
m = 5.30435 
^2 = 3.65668 
t)3 = 4.752437 
Vi = 52.85276 
Vi = 197.39949 


Type A: j{t) = ^{t) + .117261 + .041508^j\<) 

Type B: F{x) = i^{x) - .86550AV(2!) - .77352AV(a:) 

+ .02814AV(a:) + .57459AV(a:) 

Using uncorrected moments 


TypeB: F{x) = i^{x) - .82384AV(a:) - .73185AV(a:) 

+ .03192AV(a:) + .94033A'i^(a!) 

(last column above) 
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TABLE 5 


X 

Frequency 

Type VI 

TypeB 
i terras 

TypeS 

5 terms 

0 

1 

1 

-9.5 

-2.0 

1 

56 

50 

83.2 

69.9 

2 

167 

168 

141.6 

143.1 

3 

98 

100 

102.3 

110.7 

4 

34 

36 

41.5 

40.2 

5 

9 

10 

8.7 

4.6 

6 

2 

2 

.05 

2.0 

7 

1 

.5 

-.4 

1.0 


Corrected moments: Rough moments: 
m = 2.402174 m = 2.402174 
M2 =.928835 jLi2 = 1.012169 

Ms = .893096 M3 = .893096 

M4 = 4.088800 Ml = 4.313176 
M6 = 11.28304 
as = .87704 
ai= 4.2101 

Type B, adjusted moments: 

F{x) = ^|^{x) - .73667AV(«) - .48516AV(x) “ .06424AV(a;) + .10365AV(x) 
*Type B, rough moments: 

= ^(a;) - .69805AV(:r) - .44654AV(a;) - .06587AV(a:) + .16165AV(a:) 

* This is used in last column of above. There is a slight error here, which however will 
not affect the results materially. The third decimal place may be slightly wrong. 
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TABLE 6 


Normal curve 


X 

Frequency 

Normal curve 

Type I 

0 

.6 

.6 

2,3 

1 

2.8 

2.7 

4.7 

2 

11.5 

10.9 

8.7 

3 

27.7 

30.1 

25.2 

4 

59.1 

68.4 

55.2 

5 

84.7 

80.1 

79.5 

6 

74.1 

76.9 

80.1 

7 

50,5 

52.2 

58.1 

8 

23,2 

25.0 

29.7 

9 

12.2 

8.4 

8.6 

10 

1.3 

2.4 

-.9 


Moments corrected; 
m = 5.393443 
fi2 = 2.769635 

/ia == .029805, M4 = 22.40663 
as = .0064 
0=4 = 2.920997 


TypeB: Fix) = ipix) - 1.31l9AV(a:) - .4179A3^(x) + 2.1625AV(a:) 

Colorado State Oollbqjb 
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A TEST OF A SAMPLE VARIANCE BASED ON BOTH TAIL ENDS OF 

THE DISTRIBUTION 

By John W. Feetiq 

With the assistance oe Elizabeth A. Proehl^ 

(1) Introduction 

In testing the hypothesis, say Hq, that an observed sample E of size N has 
been drawn from a normal population for which the standard deviation, cr, has a 
particular value, o-q , one may form the ratio 

(I) 

if the population mean m be known, or 

v' ^ S {Xi -xT/al = ^ (II) 

t=i ffo 

where x is the sample mean, if the population mean be unknown. The proba- 
bility of obtaining a larger (or smaller) value of v or v' than that observed may 
readily be obtained from the appropriate tail area of the distribution with 
n = W or n = (JV — 1) degrees of freedom respectively. The alternative 
hypotheses to Ho concerning the normal populations from which the sample 
may have been drawn assign different values to c and form a set of hypotheses, 
0. The members of Q may be classed according to whether they specify 

(T > ffo , or (T < (To • The practice of regarding only one tail of the distribution, 

the upper or lower depending on whether v > JV or v < iV, is tantamount to 
accepting as admissible alternatives to Ha only one of the classes of 
The alternatives may sometimes be limited to one class or the other through 
some a priori knowledge, or the problem may be such that only one of the classes 
is relevant. However, since this is not generally the case, some method of 
considering all of the alternatives is needed. When testing hypotheses con- 
cerning the mean of the sampled population, the problem is quite simple, since 
the distribution of means is symmetrical. Thus, the "corresponding” value to 
any positive deviation, {x - w), is the negative deviation of the same magnitude. 
Merely doubling the tail area pertaining to either of the deviations will serve to 
take account of both classes of alternatives, i.e., those in which m > m and 
those in which m < ma. The problem is more difficult in the case of u or w'. 


' From the Memorial Foundation for Neuro-Endocrine Research and the Research 
Service of the Worcester State Hospital, Worcester, Massachusetts. 
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•since the distribution is not symmetrical In addition to the value of v or v' 
pertaining to the observed sample we require a ^^corresponding^^ value at the 
other end of the distribution^ The definition of “corresponding” which is 
accepted will determine the required value* There may be a number of such 
definitions but not all of these will be equally acceptable* The value of v 
which delimits an equal tail area specifies one of the possible definitions of 
“corresponding.” Another definition would require that the ordinates at the 
two values of v be equal. 

The Neyman and Pearson Approach. Generalized procedures for. testing 
statistical hypotheses have been elaborated in recent years by J, Neyman and 
E. S. Pearson (1-5). These have considerable philosophical appeal and will be 
traced as a basis of solution of the immediate problem. A test of a hypothesis 
Hq consists essentially of a rule for rejecting Ho when the observed sample E 
falls within a suitable critical region w of the AT-dimensioned sample space Wj 
and of accepting Ho when E falls in (IF — In testing any hypothesis two 
types of error may be made : 

i) Ho may be rejected when it is true; 

ii) Hq may be accepted when some alternative hypothesis, Hi , is true. 

Errors of the first kind may be considered “equivalent” since, if a true hypoth- 
esis is to be rejected, it is immaterial which one is chosen. Furthermore, the 
first type of error can be controlled through our choice of the size of Wj say a. 
The size of represents the probability of a sample E being an element of w 
when the hypothesis Ho is true. This probability may be designated briefly as 
P{E€w\Ho], Then 

P{Etw\H,] = j f dxxdx2 • ‘ ' dx}f = a (HI) 

where p{E | Ho) is the elementary probability law of the sample when Ho is 
true, i.e., 

p{E 1 Ho) = p{xi jX2j Xif\ Ho) (IV) 

Errors of the second type, however, are not equivalent, since their consequences 
depend on the difference of the true hypothesis from Hq . The utility of a test 
of Hq will depend largely on how it controls the second type of error. Ideally, 
the selection of a critical region should take into consideration the probabilities 
d priori of the hypotheses composing Since these probabilities are generally 
unknown, tests may be sought which are valid independently of them. 

A distinction must be made between simple hypotheses which specify com- 
pletely the elementary probability law of the sample, p{E)y and composite hy- 
potheses which specify the law subject to one or more undetermined parameters. 

(2) Simple H3rpothesis Concerning Population Variance 

A test based on a critical region Wo may be called independent of the probabili- 
ties d priori of the alternative hypotheses if it is more powerful than any other 
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equivalent test for all of the alternative hypotheses (3). An equivalent test 
Is one based on a region wi of the same size, a, i.e., 

P\E e I Hq] = P{B e Wx I i/g) = a (V) 

The power of a test based on any critical region, as Wi , is the probability of its 
rejecting a hypothesis Hq when some other hypothesis Hi is true. That is, 
it is the probability of E falling in wi when Hi is true. Denote this power by 
P{E ewi\ Hi}. The greater the power of a test, the smaller the risk of the 
second type of error. If tests as defined above exist, they minimize the proba- 
bility of the second type of error. Furthermore, the probability of the first 
type of error is no larger than a. Neyman and Pearson (2) have designated 
regions satisfying this definition as Best Critical Regions for testing Ha with 
regard to the set Q. If there is no such Best Critical Region, some compromise 
region must be chosen. 

A necessary and sufficient condition for wq to be a Best Critical Region with 
regard to an alternative Hi is that within v)q 

p{E\Ha) < kp{E\Hi) :,(VI) 

where k is some constant depending on ct. If this inequality is true for any Hi , 
wq will be a Best Critical Region for the set 

Neyman and Pearson (2) have shown that in testing the hypothesis that 
O' = o-Q , when the population mean m is known, there are two Best Critical 
regions, one pertaining to the class of alternatives for which o- < o-g and defined 
by y < 2 ^ 1 , the other to the class o- > o-g defined by > wa • Vi and vz are values 
of V so chosen that the size of the critical region shall be a. Although there is 
no Best Critical Region for all of the alternatives, the choice of a compromise 
critical region should still depend on its control of the second source of error, 
that is, on its power for the various alternatives (4). Such a compromise 
region may be designated as a Good Critical Region. What is needed is a 
region lOg of size a defined by the inequalities y < ui and v > Vi. If ri and V2 
are taken as the values cutting off equal tail areas, then the power of the test 
will be less than a for some values of a less than o-g . For those values of o-, Ho 
would be accepted more frequently than if it were true. Thus a first require- 
ment for a Good Critical Region is that its power should nowhere be less than a, 
the value when Ho is true. Of all such unbiassed Critical Regions of size a, 
Wq should then be selected so that its power is everywhere greater than that of 
any other equivalent unbiassed region. 

Critical Regions sufficiently satisfying the above requirements can often be 
obtained by stipulating that the first derivative of the power function with 
respect to 0, the parameter under consideration, shall be zero at ^ = 0o , and 
that the second shall be a maximum there. Then not only does .the probability 
of the second source of error decrease as we move away from do , but it decreases 
most rapidly in the vicinity of Oo . Critical Regions satisfying these conditions 
are called unbiassed Critical Regions of Type A, (4). Under certain assumptions 
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conceraing the nature of the elementary probability law fiJE j B) it can be shown 
that iwo is defined by the inequalities cpi < c\ and ipi > C 2 where Ci and ci satisfy 
the conditions 



/ pivi) d;pi = 1 — a 

Jci 

■ (VII) 


rc2 

/ ipi?>(ipi) dtpi = 0 

Jci 

(VIII) 

where 

d log p{E 1 0 ) 

do 9=^6^ 

(IX) 


and ^( 0 ) is the distribution function of v?i . 

In applying these results to the testing of the hypothesis that a = ai when 
the population mean is known, 

= (i; - iV)Ao (X) 


Obviously p(t'), the distribution of v, may be considered instead of is 

defined by the inequalities v <vi and v >v^ where 


dv + i p(y) dv = ai -j- a 2 ^ a 


rvi 

I P(v) < 

J rv2 

I (v — N)p(v) dv = 

VI 


v^n-vj^ 


= 0 


..(XI) 

.(XII) 


Wq so defined is also of type 4i, that is, its power curve lies everywhere 
above that of any other equivalent region, vanishing in the first derivative at 
(T = CTo , (4). 

The use of -yjo as the appropriate critical region is equivalent to the use of r 
as a test criterion, where 


= r)“- - XIII) 


That is, a value of v yielding the same r as the observed v may be taken as the 
corresponding value* Reference to the appropriate tables and summing of the 
two tail areas gives Pr , the probability of obtaining a smaller value of r when 
ffo is true. Hq may be rejected if P^ is less than some previously fixed number, 
say a* If the distribution of r could be evaluated the necessity of dealing with 
two values of v would be obviated. 

The criterion r is equivalent to that deduced by the use of maximum likelihood 
ratios (6). Thus, 


y 

(xx-m)2/2ir2 

p(E\a^) = (27ro’^)~'^^^e 


(XIV) 


* The solution is the same in terms of o'®. 
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Maximizing p{E ] <r^) for fixed E and all possible we have 

Pm.x.(-E I ff') = A"""" 2 t S {Xi - m)“J (XV) 

(XVI) 

(XVII) 


^ p(Jgkg) — at-'A(/ 2 Ar/2 
Pm.x.(filff“) 

^ 



The A**" moment coefficient of X about zero, mUx), is given by 

SN 0 - + h) 

2 


mI(x) 


r 


r(iV/2) 


(2e/X)^'"^ (1 + (XVIII) 


Probability that a sample has been drawn from a normal population with a specified variance or standard deviation 

Degrees of Freedom, n 
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For N inlBiiiite, (-21og,X) will be distributed as with one degree of freedom. 
For finite values of iV, however, we have not been able to evaluate the dis- 
tribution of X, although the distribution of the Incomplete Beta Function serves 
as a good approximation. Approximate distributions for several values of N 
have been obtained. P ^ , the probability of obtaining a smaller value of X 
than that observed, as obtained from these distributions agrees well with the 
sum of the tail areas pertaining to Vi and V 2 yielding the same value of X (or r). 


The construction of tables is simplified by taking (1) 

logio X = i\^/2(logio e - it) (XIX) 

That is, 

X - log^ X = k log, 10 (XX) 


where x = v/N, Equation (XX) is independent of N and may be solved once 
and for all for x, given fc.® In Figure 1 is plotted the graph of equation (XX). 
For convenience, the branch of the curve giving the roots greater than unity 
has been folded back with altered scale from the minimum value of k, logioe, 
occurring at x = 1. Table I was then constructed by multiplying the two 
values of x for a given k by {N/2)\ referring to the Tables of the Incomplete 
Gamma Function (7) with p = (iV — 2)/2, and adding the resulting two tail 
areas. The values for the odd numbers above 12 were obtained by interpolating 
between the even numbers. For N = 1, (x)^ was used as a normal deviate. 
The values in Table I should be correct to four decimals. Table I is entered 
with the number of degrees of freedom, n, on which x is based. In the case of the 
simple hypothesis this is N. 

The following may serve as an illustration : Blood urea nitrogen determinations 
(mg./lOO cc.) were made on a sample of 25 schizophrenic patients. The mean 
was found to be 15.56, the variance, 10,486. Previous investigation of blood 
urea nitrogen on a large sample of normal control subjects gave a mean of 16.03 
and a variance of 20.268, which for the purpose of the example may be considered 
as the population parameters. Then we may wish to test the hypothesis that 
the variance of the sampled population, (/ , is = 20.268, knowing the mean 
of the sampled population to be 16.03. Calculate 

^ + = .528 

Referring to Fig. 1, the value of k is about ,505. Turning to Table I with 
k = .605/ 71 = 25, P is found to be .0457. We should thus be inclined to reject 
the hypothesis. 

For N small, the area of the tail of the distribution near zero is considerably 
larger than that at the upper end. As N increases the distribution of v becomes 


’ If the solution were explicit the distribution of X could easily be deduced from that of x, 
* k obtained directly from {XX) is .507, corresponding to P = .0427 . 
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more and more symmetrical and the two areas approach equality. Even for 
N = 50 , however, they are rather unequal, so that merely doubling the area 
pertaining to the observed v does not give a sufficiently accurate approximation. 
For > 50 an approximation correct within several units in the third decimal 
place may be obtained by taking ^y 2 N{'\/x — 1) as a normal deviate. This 
assumes that the standard deviation is normally distributed with variance (tI/ 2 'N, 

(3) Composite Hypothesis Concerning Population Variance 

Here i^o specifies^nly the value of the parameter 0 = , leaving undetermined 

the value of a second parameter, r. Thus, Ho consists of a subset, w, of simple 
hypotheses, each of which specifies a different value for v. Any simple hypoth- 
esis specifying different values of both parameters, d and is an alternative 
to Ho . These alternatives form the set 12. The elementary probability law 
determined by Ho is | Ho) = 1 while that determined by an alterna- 
tive hypothesis Hi is p(H | = p{E | In testing composite hypotheses 

the first requirement is to find regions ''similar^' to W with regard to v, i.e., such 
that the chance of rejection of a true hypothesis, P{H e u; | Ho), equals a for all 
the values of v specified by the simple hypotheses composing Ho . A test based 
on a similar region Wq may be called independent of the probabilities & priori, 
if its power with respect to all the alternatives of 12 is greater than that of any 
other similar region Wi of the same size, o:, ( 3 ). Let 

^2=6 log p(H I dv)/dv\e^B(, (XXI) 

Then the equations v>2 = constant will describe hypersurfaces in H-dimensioned 
space, on one of which the observed E must fall. Under certain assumptions 
pertaining to the law of elementary probability it can be shown (2) that a 
necessary and sufficient condition for n? to be a similar region is that 

P[E e w(cp,) 1 Hoi = aP{E € WM I Ho} (XXII) 

for all values of ^2 , where w(ip2) and W{(pz) are parts of the surface <P2 = constant 
common to w and W respectively. A similar region is’ then built up of these 
parts obtaining for the various values of <P2 . The Best Critical Region, 
Wo , for a particular simple alternative, Hi , must then be composed of pieces, 
wo{(p2)j maximizing P{E €Wo{(p2) | Hi]. The problem is the same as for simple 
hypotheses except that we shall be working in a space Wicpz) of (N — 1 ) dimen- 
sions. wq((P2) is defined by the inequality 

p[E I HO > kM v{E I Ho) (XXIII) 

where k{<p2) is some constant depending on a. If Wo{<p2) is the same for all H{ , 
then Wo is the Best Critical Region for testing Ho with respect to 12. 

Heyman and Pearson showed (2) that in testing the composite hypothesis that 
<T = (To when the population mean is unknown there are two Best Critical Regions 
corresponding to the class of alternatives o- < (Tq and <r > o-q , defined respectively 
by the inequalities v' < v[ and R the whole set of alternatives, 12, is to 
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be considered some compromise region must be sought. Dealing with the case 
where similar regions exist Neyman (5) defines a Critical Region as unbiassed 
and of Type B if the first derivative of the power function, P{E ew\ Hi), with 
respect to 8 vanishes at 0 = 6o , and if the second derivative at that point is a 
maximum. Let 


<pi = 


a log p{E I dv) 
dO 


(XXIV) 


Then it can be shov/n that the desired region will be defined by the inequalities 
(Pi ^ ^ 1 (^ 2 ) and <pi > where ki((p 2 ) and are determined to satisfy 


and 



(1 - a)pi(p2) 


(XXV) 


' (pipi(pi<P2) d(pi = (1 — 

^1(^2) 


«) J d<pi 


(XXVI) 


where p(<p 2 ) is the distribution function of <^ 2 , and is the simultaneous 

distribution of <pi and (pz . 

Applying equations (XXV) and (XXVI) it follows that the appropriate 
Critical Region is defined by the inequalities < v[ and v* > v'z where 


and 


a = ai + as 

A 



p(j)') dv' + 



(XXVII) 




= 0 


(XXVIII) 


where p(d') is the distribution function of v'. 

The use of the unbiassed Critical Region of Type B corresponds to adopting 
as a criterion 




(XXIX) 


Since v' derived from a sample of size H is distributed as v derived from a sample 
of size (iV — 1), it follows that r' is equivalent to the r of equation (XIII) based 
on a sample of size {N — 1). Therefore Table I may also be used for testing 
the hypothesis that o- = o-q whatever be the population mean, by entering with 
the number of degrees of freedom, N — 1. 

In the example previously used, compute 

' X = ^ = 0.517 

Prom Figure 1, k is approximately .51, corresponding to P = ,0422. 
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r' is not the same as the maximum likelihood ratio (6). 

X' = 1 ^-NiSy;N/2g-Hv-N) ^ , ,(XXX) 

pmax(i!|cr“m) 

As JV becomes infinite the distribution of V is the same as that of the X of (XVI). 
For N = 49, the probabilities corresponding to X' agree with those using r' to 
within a unit in the third decimal. 

The X' test is biassed as may be seen in Figure 2 where we have plotted the 
power of the test based on the region w defined by v[ = 3.187, ~ 22.912 for 

which a = .0436 + .0064 = .0500, on the assumption that al = 1.0, for N = 10. 
Although the criterion is biassed it is slightly more sensitive to alternatives 



Fig. 2. Comparison of Critical Regions tor v'. Ho Specifies ol = 1.0. iV = 10, 

specifying ^ < al than is the unbiassed Critical Region of Type B defined by 
v[ = 2.953, Vi 20.305, rr = .0339 + ,0161 = .Q500. The criterion of con- 
stant distribution, 

^ (XXXI) 

has also been considered. In this case v'\ = 1.903, v'i = 17.391, a = .0071 + 
,0429 = .0500. This criterion is biassed for some alternatives specifying 
< (To , but its power curve lies above that of the unbiassed region for a-^ > o-l. 
Apparently the bias may be shifted at will by changing the exponent of v'. 
This may be desirable if greater weight is to be given to one class of alternatives. 
In fact decreasing the exponent of v' to 0 produces the Best Critical Region 
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for the class of alternatives specifying > (Xo ^ and defined by Vi = 0, 1/2 = 16.919 
for a = ,0500. No region can be found giving greater power. On the other 
hand this region is insensitive to alternatives of the other class. Increasing the 
exponent indefinitely produces the Best Critical Eegion for the other class 
defined by 2^2 = 00 and^i = 3.325 for « = .0500. 
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ON THE POLYNOMIALS RELATED TO THE DIFFERENTIAL EQUATION 

1 ^ _ go + aix _ N 
y dx bo + hiX -|- D 

By Prank S. Beale 


latroduction. In a previous issue of this Journal,^ E. H. Hildebrandt has 
established the existence of a general system of polynomials P„(fc, x) associated 
with the solutions of Pearson’s Differential Equation 


(R) 


1 ^ ^ 
y dx D’ 


N and D being polynomials in x of degrees not exceeding one and two respectively 
with no factor in common. 

It was 'shown that the polynomials Pnik, x) s Pn themselves satisfy certain 
differential equations and a recurrence relation. The classical polynomials of 
Hermite, Legendre, Laguerre, and Jacobi are special types of Pn{h, x). Since 
the classical polynomials are employed rather extensively in statistical theory, 
certain of their properties are of special interest. 

It is the purpose of this paper to determine from Hildebrandt’s general equa- 
tions some new properties of Pn(k, x) and to apply these properties to the 
classical polynoniials. The paper consists of two parts. In part I some 
theorems are established concerning common zeros of D and P„ . In particular, 
a theorem is established to exhibit the conditions under which the zeros of P„ , 
which are not zeros of D, are simple. , In part 11 a method is outlined for the 
classical polynomials by which one can determine the number and location of 
the real zeros in the various segments into which the zeros of D divide the x axis. 
The points of inflexion and the degree of the polynomials are also considered. 

A new feature of the method employed is, we believe, its being based upon the 
use of differential equations of first order, for most part, while other investi- 
gators^ have employed differential equations of second order. As to the results 
obtained, the author believes them to be partly new. They have points in 
common with the results of Fujiwara, Lawton and Webster, 

‘ Systems of Polynomials Connected with the Charlier Expansions, etc., Annals of Math, 
Stat., Vol. II, 1931, pp, 379-439. 

’ M. Fujiwara: On the zeros of Jacobi’s Polynomials, Japanese Journal of Math., Vol. 2, . 
1926, pp. 1, 2. 

. W. Lawton: On the zeros of Certain Polynomials Related to Jacobi and Laguerre Poly- 
nomials, Bull. Am. Math. Soc., Vol. 38, 1932, pp. 442-449. 

M. S. Webster: Thesis, Univ. of Penna, These results were kindly communicated to 
me by Dr. Webster. 
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I. Theorems Concerning Common Zeros of P„(/c, x) and D 

The following equations will be employed later; 

( 1 ) Pr^+iik, x) =[N + {k- n)D']P^{k, x) + DPUk x). 

(2) P'n+iik, x) = in + 1 ) N' + D"] P„(fc, x). 

Pn+i(fc, x) = [N+ik- n)D']P.(k, x) 

P"] DPUk, x). 

These are not explicitly given in Hildebrandt's Paper but the method of obtain- 
ing them is outlined there in detail. 

We shall make use of the following lemma which we state without proof. 

Lemma (1) . Let Pn{x) he a polynomial of degree n. If both Pn and contain a 
factor (x — a)*”, m < n, then Pn contains the factor (x — 

We also need an expression for Pl^li(kj x). By repeatedly differentiating (2) 
and eliminating Pnik^ x) we get, 

PS.tt, rt) . n (» + 1 - ijljV' + ” + ,) 

(4) ■- L 2 J 

^ = 1, 2, • . • (n + 1)^ 

Theorem h , If D is a perfect square^ D' is not a factor of Pn+i (/c, x)y n = 

0 , 1 , 2 , . . . 

Proof: Assume D' to be a factor of Pn+i . From ( 1 ), D' is either a factor of 
Pn or of W + (fc — n) D'. But D' is not a factor of JV + (fc — n) D' as this 
implies that D' is a factor of N contrary to hypothesis on (R) that D and N 
have no factor in common. Thus, D' is a factor of Pn , and by a repetition of the 
reasoning a factor finally of Pi , which as it was just pointed out, is impossible. 

Theorem I^, Set D = (uix + Pi){a 2 X + ) 32 ), i) not a perfect square. If 
oLiX + /3t , i == 1 or 2 , is a factor of Pn , then {aiX + /3i)® is a factor of Pn+g-i , 
^ 2, 3, . . . 

Proof: From (1), aiX + being a factor of Pn and i), is also a factor of 
Pn+i . From ( 2 ), aiX + /?* is a factor of Pn+i . From Lemma ( 1 ) it follows 
that {aiX + PiY is a factor of Pn+i . Continued repetition of the reasoning 
establishes the theorem. 

Corollary. If both cniX + and a^x + ^2 are factors of Pn , then D® is a factor 
of P n+ 3— 1 • 

Theorem J 3 . Assume D of the same form as in Theorem I 2 . If aa + pi , 
i = 1 or 2 , is a factor of Pn+i and no higher power of oax + Pi is such a factor then 
cLxX + Pi is a factor of N (k — n)D^. 

Proof: From (1), oax + Pi being a factor of Pn+i and of D is also a factor of 
either W + (fc — n)D' or of Pn . But aiX + pi a factor of Pn requires, from U , 
that {aiX + piY be a factor of Pn+i contrary to hypothesis. Thus, ,aiX + pi is a 
factor of W + (k ^ n)D\ 



208 


FRANK S. BEALE 


Corollary, If (aiX + ^i){ol2X + ^32), (<xi , a2 ^ 0), is a factor of P n+i and no 
higher 'power of either aix + ft or aiX + ft is contained in P n+i then N {k ~ n) 
D' = 0. For from Ig , N + (k - n)D' contains {ocix + Pi){aix + ft) as a factor 
which implies N + {k — n)D', being linear, vanishes identically. 

Theorem h . If iaiX + (3;)® and no higher power of onx + is a factor of 
P„+5_i then aiX + ft and no higher power of aiX + is a factor of P„ . 

Proof: Let us write, 

(A) Pn+t-i == {diX + /Si)® 4>r^-i , </>n-i - a polynomial of degree < n - 1 which 
does not contain the factor <x{X + /?,■ . Taking the (q — 1)“*' derivative of (A) 
by Leibnitz Theorem, we get. 


(B) 


P 


(9-1) 

n+g-1 



iuiX + 






<hn~l - 


On setting g = 5 — 1 in (4) there results. 


(c) p^T,-! = n (w + g _ 1 _ z) 


N' + 


2h 


q + i + 2 


D" Pn. 


From (B) we see that aiX + /3t is a factor of PnVo-i* No higher power of 
cLiX + /3i is such a factor. From (C) our theorem now follows. 

Corollary (1). Under the hypotheses of Theorem I4 , oiiX + is a factor of 
+ (fc — n + 1)P'. This follows at once from I/l and h « 

Corollary (2). If P® = (aiX + ft)® (a^x + ft)®, (ai , (X2 ^ 0), is a factor of 
Pn-fc-i and no higher powers of either axx + ft or ol^x + ft are factors j then N + 
(fc — n + 1)P' = 0. For the linear expression iV + (fc — n + 1)P' contains, 
from Corollary' (1), the quadratic factor {aix + ft) {a^x + ft). 

The following lemma can be easily established and is given without proof. 
Lemma (2). Assume D of the same form as in Theorem I2 . Then there is only 
one value of s for which N + sP' contains aiX + cts a factor. 

Theorem 1 5 * Assume P of the same form as in Theorem J2 , // iV' + (A — n)D' 
contains aiX + /3x , ^ = 1 or 2, as a factor, then Pn+i contains aiX + /3i and no 
higher power of UiX + j3i as a factor. 

Proof: From (1) we see that Pn^i contains oiX + jSi at least to the first power 
a factor. Again from (1), if Pn+i contains a higher power of aiX + as a 
factor, this means that both Pn and P^ contain aiX + at least to the first 
power as a factor and from Lemma (1) it follows that Pn contains aiX + at 
least to the second power as a factor. By corollary (1) from Theorem 74 it 
follows that OiiX + /3i is a factor of 77 + (A — ni)D' for ni < n, contrary to Lemma 
(2). 

Theorem It . If aix -f* ft and a^x + ft are factors of N A- (k — wi)P' and 
N + (A — n2)P' respectively, (ai ,oi2 7^ 0), then P^ = 0 , /a > ni + ^ . 

Proof: From Theorems It and I2 we see that {aix + ft)”^ (0:22: + ft)"S of 
degree Rx + , is a factor of Pnj+ni ? of degree + Ui at most. Similarly, 
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(ail + ft)”"'*'' (asS + ft)”'"*'', of degree nj + iii + 2, is a factor of Pnj+ni+i , of 
degree ria + ni + 1 at most. This implies s 0. Hence, P„ s 0, 

iji> ni + rii. In fact, (1) shows that P,, ^ 0 implies P„ = 0, )/ > /i. 

Theorem It . Assume D of the same form as in Theorem h . Then P„ 4 i = 0, 
P„ ^ 0, implies either JV + (fc - m)D ' s 0, ot < n, or there exist two values of 
m, {mi , wijs), such that JV (fc — mf)!)' , N {k ~ mz)D' contain as factors 
aix + /3i and a^x + ft respectively, {mi,nh<n). 

Proof: Setting P„+i s 0 in (1) gives, 

(1") [N + (fc - n)D'] P„ + DP'„ ^ 0. 

If Pn = const., 1° shows that + (fc — n)D' = 0 and our theorem is verified. 
Suppose Pn ^ const. We get from (l”), 

p, _ [N A- {k- n)D']Pn 
” ^ • 

Thus, D is a factor of the numerator, and our theorem now follows from Corolla- 
ries (1) and (2) of Theorem h . 

Theorem Jg • J/ iV + (ib - m)D' ^ 0, m - 1, 2, • • • n, and f/ iV + (fc — m)D' 
contains neither aiX + ft , nor a^x + ft as /actors, then PnA-i and D have no factors 
in common. This follows at once from Theorems h and h which constitute a 
necessary and sufficient condition that P„ and D have factors in common. 

Theorem Iq . If N ^ const, and if D is linear, all Pn are constants^ n = 1, 2, 3, 

• • • . This follows directly from (2). 

Theorem Zio . If N' + 7 ^ 0, m = 1, 2, • - • (n — 1), all zeros of Pn 

A 

which are not zeros of D are simple. 

Proof: Suppose Pn has a multiple zero x - a which is not a zero of D, Then 
(1) shows, that a is a zero of Pn 4 i . From (2), a is a zero of Pn+i. From. 
Lemma (1), a is at least a double zero of Pn+i . Furthermore, (3) shows that a 
being a double zero of Pn and of Pn+i is also a double zero of Pn-i . By a con- 
tinued application of (3), it follows that a is a double zero of Pi which is impos- 
sible since Pi is of degree <1. 

II. Concerning the Zeros of P^ik, x) 

The polynomials P„(fc, x) are defined by Hildebrandt^ as follows; Pn(lc, x) = 

„ — D^y xvhere y is a non-identically vanishing solution of the differential 

y dx^ 

equation 

I dy _ gp + aix ^ ^ 
y dx 6o -j- bix + hx^ D' 


3 L.c. pp. 400-401. 
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The Jacobi Polynomials are defined as follows: 


real. It follows that Jn{x, a, /3) is a special type of P„(lc, x) with N s {-P-oi) 
X + a, D = x{l -- x),n k + I, whence, 

N’ = -/J-a, £>' = !- 2a:, D" = -2; D(0) = D(l) = 0, 


X = 


PiQc, x) s N + kD' = 0 for 
“ + ‘ ?;»,«,) = -/3 - a 


Q! -j- |3 


2k 


In determining the number and location of the real zeros of the Jacobi Poly- 
nomials we employ the following notations : ^ 


Pi{k, x) = Oioxx i = Ij 2, 


fc+l;fc=0, 1,2, ...;j = l,2, ...i. 




e = N' + ^ -0- a -2k + n, n = 1, 2, . . ■ fc, 

A 

ju = [iV^ + (fc — = a + (fc — • n), 

. = [iV + (/c - n) D%^i = n). 

We proceed to determine the number of real zeros of the Jacobi Polynomials 
on the intervals (~ oo, 0), (0, 1), (1, oo) into which the zeros of D divide the 
X axis.*^ The proofs proceed by mathematical induction. We first determine 
the location of the real zeros of Pn(kj a;), n = 1, 2, • • • A; + 1, by successive 
applications of (1) and (2). We then use the relation Pjb+i (fc, ^ Ja+i P)- 
Several cases concerning possible values of a and ^ should be considered. In 
order to bring out the method of procedure only two such cases will be fully 
discussed here. The results for other possible cases will be merely listed. 

Ai : a < 0, /9 < 0, [ a I < \ ^ \ a ’j- ^ not integers. 

Let hi be the greatest integer contained in a:, 

n 7 a ff n n (i n Q 

^2 fjj 

’h be the greatest integral value of k for which a + 0 + 2k <0. Then 


0 < h < ks < k2 • 


* In the case «, /? > 0 these zeros all lie, as is known, ov (0, 1). 
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All : 0 < A: < fcj . 
Then Jk+i{^, P) has 


We then have 5 > 0, ju < 0, v > 0, 0 < «ui < 1, Pl > 0. 
( 1 )*' + (_!)'' 

2 in 0, 1. These are the only real zeros. 


Proof: Consider first Pi{k, x). Its only zero is at aj,t,i , where 0 < ai,*,,! < 1. 
Furthermore, P[ > 0. Also Pi > 0 for a: > ai,*,,! and < 0 for a: < ai,*,i . From 
(1) we see that PiQc, cii,k,i)> 0, (since Pi(fc, Q:i,fc,i) = 0, D{ai,ka) > 0 and P[ > 0). 
From (2) it follows that P't{k, x) < 0 for a: < «i,fc,i , P((/c, xi) = 0, Pi{k, a;) > 0 
for X > q:i, 4 ,i . These conclusions follow from remarks concerning the sign of 6, 
the fact that Piik, o:i,*,i) = 0; and from remarks concerning the sign of Pi to the 
left and to the right of x = on, k,i- Thus, P^Oc, z) > 0 for all real z and hence 
has no real zeros. By employing (2), it is now evident that P8(fc, x) > 0. From 
(1) and remarks concerning y and v we see that P 3 (fc, 0) < 0 and Pa{k, 1) > 0. 
Thus Piik, z) has a single real zero D!a,*.i , 0 < « 3 , 4 ,i < 1. The reasoning from 
Pi to Pi K analogous to that from Pi to Ps . By continuing this procfedure we 
finally conclude that Pk+iik, x), {= Jh+i (x, a, fi), has but one real zero, (in 0, 1), 
if k is even and no real zeros if A: is odd. 

Aia: h < k < ks . Set k — h ~\- q, q = 1, 2, • ■ • ,kz — ki . Here d > 0, 
y> 0,n = 1,2, ■ ■ • q - 1, y <0,n = q,q + 1, ■ ,q + ki. v > 0, ai,k,i < 0, 

P'lik, x) > 0. + ? + 1 (x, a, (3) has q distinct zeros in (— «, 0) and 

-L 

— — zeros in 0, 1. These are the only real zeros. 

A 

Proof: First consider the sequence Pn(fc;.at) n = 1, 2, • • • since the conditions 
on 0, pt; aiid V do not change over this range of n. Now cLi,k,^ = 0, ai.ii.i < 
0. Furthermore since Pi > 0 we have Pi > 0 for a; > ai,fc,i and < 0 for a: < 
ai.fc.i . Pass now to x). Since D{ai,k,i) < 0 Pi {kj ai^k.i) > 0, we see 
from (1) that P 2 (kj ai.jb j) < 0. Moreover (2) shows Pi (fc, ai.^.i) = Oj Pi {h x) 
< 0 fora: < ai.fc.i and > 0 for a; > . Thus P 2 (ft, x) < 0 and a relative 

minimum at a; = Since | P 2 (?:, ± «>) | qo, we see that P^{kj x) has two 

real zeros of which the left most, , is in ( — 0). Again ^ > 0 together 

with (1) assures Pzik, 0) > 0. Thus a 2 .A :.2 is in , 0), hence in (— qq, 0), 
By continuing this reasoning on the successive Pn(fc, x), n = 1, 2, * • • q, we 
conclude that Pq(ky x) has q zeros in — <», 0 and P^Cfc, ag.fc.i) < 0- 

Next, consider the sequence Pn{k, a:),n==^ + l,^ + 2, + 

Over this range of n we have ^ > 0, < 0, j' > 0. From what has just been 

shown, Pg(fc, oiq^k.i) = 0, — oo < aqxi < 0, f = 1, 2, • • • (?. Also Pg(fc, 
f = 1, 2, • • • g, is alternately negative and positive. Suppose q odd, (similar 
reasoning holds for q even). Thus, we suppose Pq{kj oiq,k,i) < 0, Pg(/c, oiq,je,q) < 
0, Pq{k, x) > OioT X < aff,*.i and < 0 for a: > (^^,k,q . (1) shows Pq^i{kj ocq^k,i)f 
i = 1, 2, • • » g, to be alternately positive and negative. Thus, the zeros ag, k^i 
are separated by g — 1 zeros of Pg+iCfc, x). Since from (1), Pg+iCfc, > 0 
and from (2) Pg+i(fc, x) > 0 for a: < oiq.kii , there exists a zero in (- 

Thus far, we have established the existence of g zeros of Pg+i(/s, x) in 
{— °o, 0). g being odd, we have from (1), Pg+i(^J, > 0* from (2), 
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PUi(fc, x) < 0 for X > aq,k,Q , Again from (1) and assumptions regarding m and 
V it follows that Pq-\-i{kj 0) > 0, Pq-^i{kj 1) < 0. Thus, x) has a zero 

«g+i.*.5+i 1)- There being no extrema for Pq+i{k, x) other than the , 

i = 1, 2, • • • g, (as (2) shows), we have thus proved that Pg+iC/c, x) has q 
distinct zeros in (- oo, 0) and a single zero in (0, 1). Eeasoning similarly from 
to Pg^ 2 {k, x) we establish the existence of q distinct zeros aff+s.jb,,-, 
i = 1, 2, • ■ • g, in (-^ 00, 0) with a^+ 2 ,k,i in (- oo, ag+i,*,!) and <xg^ 2 ,k,ij i = 
2, 3, ■ ■ ■ 3, separating ^ = 1; 2, • • • From (1) we see that Pq-^^ik^ 

"m.fc.c) < 0 and Pg+2(fc, a^+i.^.g+i) < 0. The only extrema of Pq^ik^ x), 
(as (2) shows), are located at , i == T, 2, * ■ • g + 1. Again, by (2), 

PU^{k^ < 0 for X > a:g+i,jfc,g+i ; hence there can be no real zeros of except 
the q zeros in oo, 0) already found. The reasoning from Pg ^2 to Pq+3 is 
similar to that from Pq to Pq^i . Thus, Pg+^i+i ^ has q distinct zeros in 

(— 00, 0)^ together with one zero in (0, l) for ki even. For ki odd, there are q 
distinct zeros in (— 00, 0) only. The results are the same whether q is odd or 
even. 

The results for the remaining sub-cases under case Ai are given in the table 
which follows. For completeness, the results for cases An and A12 are included 
in the tabulation. A few words of explanation are necessary to clarify the 
conditions under which the various sub-cases in the table occur. Let 1 a | = 
ki 'p qj \ P \ ka hf hf q < 1. If q -p h < Ij then \ a -p p\ = fci -f- ^2 and we 
have either, 

Ai3i : ki'pkz even, 211;3 = fci + ^ Jfca — fci = fe — ^3 . 

Ai32 * ki k^ odd^ 2^3 ^ hi -p ki — 1 — ^3 — /ci — Aia — hz 1. 

Again if 1 < g + /i < 2, then la + )3l = fci + fc2 + l and we have either, 

Ai 33 i )ci -}- fcg 1 2ks = ki -p kz -p 1 ^ ks — hi ^ ki kz -p 1. 

Ai 84 I fci "p H“ 1 oddj 2kz ^ ki -p ki ^ kz ki ^ ki ^ kz 
In cases Am and Aibi we assume |a + ^l = fti + fc2 + p, p<l, while in cases 
Ai4a and km , 1 a + /? | = fci + fca -f p, 1 < p < 2. The complete results for 
case Ai follow. (See page 213.) 

Ag : a < 0, < 0, 1 o! I < I /9 I , a, /3 Tzo^ integers^ a + == integer. Define ki , 

, A3 as in Ai . Then 0 < A: < A3 < . In Case A2i , + « is odd while in 

Case A22 , ^ « is even. (See page 214.) 

As : a < fi, j3 < 0, od - — Ai , integer, ^ not an integer, \ a \ < | ^ j . Define 

Ai, A2, Aa in Ai . Then 0 < Ai < As < A2 ! There are two sub-cases, A31 : the 

greatest integral value of a: + is odd, A32 : this integral value is even, (See 
page 215.) 

A4 : a < 0, ^ < 0, a no< an integer, ^ — Aj, integer, | a | < | ^ | . Define 
Ai , A2 , A3 as in Ai . Then 0 < Ai < A3 < Ajj . There are two sub-cases, A41 : 
the integral part of « 4* ^ is odd, A42 : this integral value is even. (See page 216). 
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415, A42BJ Jki+hi+^l] 5 = 1, 2, 3, •••• ■ + (-1)^1 
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differential equation 
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As : a < 0, /3 < 0, 1 a I < 1 13 I , „ ^ 
ki, h , h ^ in Ai . In cases Aji and A 


ki integer, d = -fej integer. Define 
62 ) “ + 3 is odd and even respectively. 


Cases 

Polynomial 

Range of Sub-Script 

_ Zeros in 





(-MfO) 1 = 0 

(0, 1) 

Asll) Ab 21 ; 

Jk+lf 

0 < fc < fci; 

0; 0; 

(D* + (-1)* 

o J 

A 512 , Abm; 

t/Ai-fg+i; 

g ~ Of 1; 2 , • ‘ *7 ^3 — fell 

9 : h + 1 ; 

0 

Abw; 

*3+34-1 J 

? = 1, 2, fca- fca - 1; 

h-ki- g; hi + 1; 

0 

A623; 

J *3+34-1J 

? ~ if 2j • ■ ‘f ^2 — fcs — 1; 

^3 — fci — j + 1; + 1; 

0 

Abw, Ab24J 

J *24-34-1 — ^ J 

6Cl 

II 

H-i 

JsO 

ft-- 



A 51 B, Abzb; 

«/*i 4'*2+S+1 “ t)j 

g = 1, 2, 3, • . • 




If assumptions are identical with those of As except | « | = | | , then for 

0 <k <ki, the results agree with Asu and = 0, g = 0, 1, 2, . ’ . . 

Ai-. a> 0,fi <(),\a\> \p\,linotan integer. Let h be the largest integer 


Case 

Polynoinial 

Range of Sub-Script 





(0, 1) 

Aei 

/a-I-I 

0<k <ki 

0 

Asa 

t/fci+g+I 

Q = L 2, 3, • • ■ 

Q 


Zeros in 
( 1 , “) 


2 

+ (-!)''■ 
2 


A 7 : Same assumptions as in Ab except ^ — ^ki, integer. 


Case 

Polynomial 

Range of Sub-Script 


Zeros in 





(0, 1) 

a) = 1 ( 1 , oo) 

An 

/fc4.1 

0:^k<ki 

- 1 

0 

0 (l)*+(-l)‘ 

2 

A 72 

J *1-1-34-1 

g = 0, 1, 2, 


3 

/ci -j- 1 0 

Aa : oi 

V 

A 

0, 1 a 1 = 1 ^ 1 

. Ji 

= a and results for , n > 1 are 


identical with those in A? and Ae respectively according as j(3 is or is not an integer. 
Afl : a > 0, i9 < 0^ \ a \ < \ ^ \ ; ^^ a + p, not integers, 

Let ki be the greatest integer in a + /3. 


(( 1^^ n it ft {< it p 

ki ** “ “ “ for which a + /S + 2 fc < 0 . 
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Then 0 < h < ki '< h ^ 


Case 

polynomial 

Range of Sub-Script 

Zeros in 



(-“>,0) 

(0, 1) 

(1, «) 

Aoi; 


0 < k < h; h 

0; 

0 

Aqxij 


g = 1, 2, fca; fcieven; fcg - g + 1; 

0; 

0 

Agaa) 

^ ^ 3 +b+Ji 

g = 1, 2, • • •, (fca + 1); h odd; fcs ~ g + 2; 

0; 

1 

Afts; 

J Ai+g+lJ 

g « 1, 2, •••, (*a - fci); 0; 

0; 

2 

A94; 

^ fcl+C+lJ 

g = 1 , 2 , 3 , • • • ; 0 ; 

g; 

+ (-1)*’^ 

2 


Alo : Same asmmptions as in hut now | a | = | 1 . Then hi = h = 0, 

Ji = a, and results for Jn j ^ > 1 are the same as in Ads and Au . 

All : Same assumptions as in Ag except j3 = — fe , integer. 

Case Polynomial Range of Sub-Sorijit Zeros in 

(- 00 , 0) (0, 1) x^l (1, oo) 

All,! Sameas Agi 
All , 2 Same as Am 
A u,s Same as Agg 

All, 4 2^ 3j ■ ' * > Oj hi “4^ Ij 0 

Ai 2 : « > 0, /S < 0, 1 a 1 < 1 iS I , /9 no< aw integer. <x 0 ^ odd integer. 
Define ^kz^hminAg. 

Ai8 ; Same assumptions as in A except a 4* iS = even integer. 


Cssefl Polynomial Range of Sub-Soript Zeros in 


Ai2,ii Aisa ; 

Same as An 

0) 

Ai2,2 ; 1 

*^ti+s+i) 9 “ 1, 2, • 

[/«,+! =s const. > 0; 

• • , fca; hs ^ Q i 

j 

Aij.s; ‘ i 

[•^*,+«+i5 9 ~ 1, 2, • 

[j%k,+» = const. > 0; 

' - , fca + 1 ; fta — ^ + 2 

Au,g> Aia.a; 

Same as An 


Aia.i, Ai8,4 *, 

Same as Am 
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Ai 4 : Sa7ne assumptions as in Au , except integer. Ca^es Ah^ , 

Aid ,2 Q'lid Ai 4,3 have the same results as Ai 2 ,i > Ai2,2 > and Ai2,3 respectively. 
Ai 4,4 has the same results as An ,4 , 

Ai 6 : Same assumptions as A13 except ^8 = -kz , integer. Cases Aua , A^.a , 
and Ai 6,3 have the same results as Aig,i , Aig,2 , and Aig.g respectively. Ai^,4 has 
the same results as An, 4 . 

Ai 6 :a=0, /3<0,j3 — not an integer. 

Let ki be the largest integer contained in /3. 

“ kz be the largest integer for which jS + 2A < 0. 


Case 

Polynomial 

Range of Sub-Script 

Zeros in 





(— 00, 0) 1 = 0 (0, 1) 

(1, 00) 

Aie.i; 


0 < k < k^\ 

k-, 1; 0; 

0 

Ai6,2; 

t/As+a+i; 

, „ . , 1 fc 3 -g; 1; 0; 

g = 1, 2, • • • , fci - *3; < 

Us-g + l; 1 ; O; 

0; ki even 

1; fci odd 

Aie.a; 


= 1,2, 3, 

0; 1; q^l; 

(1)‘> + (-!)*=> 
2 

An 

; a = 0, ^ 

= — Aji — odd integer. 

Define h as in An . 


Ai8 

: a = 0, ^ 

= — fci — emn integer. 

Define h as in An . 





Cafies 

Polynomial 

Range of Sub-Script 

Zeros in 





o 

II 

o 

(0, 1) 

a; = 1 

Ai 7 , 1 , Ai8,i; 

Same as An.i 





Ai 7 , 2 ; 

J * 3 + 5 + 1 J ' 

g = 1, 2, ■ , fci - fcg - 1; 

kz g j 1 , 

0; 

0 

Ai8,2; 

•A * 8 + 5 + 1 J 

g == 1, 2, • ■ ■ fcs -h 1; 

^ 3 -^ + 1; 1; 

0; 

0 

Ai 7 . 3 , Aia.g; 

J/*i+l = 0 



g - 1; 

Aji + 1 

u*i+ 5 +i; 

g = 1, 2, 3, • • - ; 

0; 1; 

Ai 9 : a 

= 0, (3 = 0. 

Ji = 0. 




Jk+1 has A; — 1 zeros in (0, 1), 1 zero at x = 

0, 1 zero at X = 

1, fc = 

1, 2, 3, 


From the definition of Jn(a:, a, p) it is readily seen that Jn(Xj a, $) « (—1)'* 
J„(l — Xj /3, a). Thus, a transformation of cc to 1 — x interchanges a and p. 
The interval (— 00 , 0) is transformed into (1, 00) and vice-versa. The points 
X = 0 and x = 1 are interchanged. Consequently, in all previous results we 
may interchange properly cx and /3. 

In the foregoing results, the only real multiple zeros that can occur are at 
either x = 0 or x = 1. In the process of determining the degree of multiplicity 
of such zeros use was made of Theorem I 2 . 

Points of Inflexion. By taking (4), setting k ^ and replacing iV' and D'* 
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by their values for Jacobi polynomials, we get: PnVi(^; x) = {n + 1) (n) 
[fi + oi + n] [^ + a + n + 1] Pn^iiriy x). From definitions of P„(Aj, x) and 
Jn{Xf a, /3) we easily verify that, 

Pn(n ±qyX) ^ /n(x, a ± q + ly ± q + l)y whence, 

Jn{X) oiy |8) i= (n + 1) (n) [j^ + a + ri] [/3 + a + + 1] /n-i (x, a + 2, j8 + 2). 


We conclude that if neither a + P + n nor a + /? + ^ + 1 yanishes, the points 
of inflexion of a, (3) are at the zeros of odd order of t/'n-i(x, « + 2, /3 + 2), 

The Degree of Jn{Xy a, /3). In analyzing the results of cases Ai to Ajg inclusive, 
it is noted that in some cases the number of real zeros of is less than n. The 
question naturally arises whether the degree of /„ is n or less, for then we can 
determine the number of its imaginary zeros. The explicit expression of 
Jn(xy a, /3) is known from which the degree of can be found for various a and 
p. However, the degree of Jn can be found from (4). 

Since J^+iix, a, P) = Pn+i(^^, x), let us replace A by n in (4) and at the same 
time replace A' and by their values for Jacobi Polynomials. Thus, we get: 


a, $) = - ■i]P„_ 3 +i(n, x), 

(5) 

n = 0, 1, 2, ■ • • ; g- == 0, 1, • • • , (n -t- 1). 


W e may establish the following results. 

Cl) If a + j3 is not &n. integer, the degree of (x, a, /3) is n -f !> Ti = 0, 

1 , 2 ,.... 

In fact, in order for Jnji to vanish, we see from (5) that either some factor 
— |3 — a — n — i vanishes or P„_g+i(n, x) vanishes identically. We first show 
4)hat the latter is not possible. Now Pi(n, x) = A + nD' = (— /3 — a — 2n) 
:x + a + n ^ 0 since + a is not an integer. Consequently, if Pfi(n, x) = 0, 
)u>0, /x<n + l there will be a first value of /x, = >), for which P,.(n, x) ^ 0 

but Py^iirty x) ^ 0, By virtue of Theorem h this means that either A + 
(n — p)I)' = [— ~ a: — 2(n — p)] x + a + n — p = 0, p < v, or else there 

exist two values of p, (pi , P2), such 'that [— jS — a — 2(n— Pi)],x + a + n — pi 
and [- — a — 2(n — P2)] x + a + n — p2 are divisible by x and 1 — x 

respectively, Pi , P2 < v — 1, pi 7*^ P2 . Since, however, a + i9 is not an integer 
we see that, [— iS - a — 2(n — p)] x + a + ?i — p ^ 0, n and p being integers. 
This eliminates the first possibility that P;j(n, x) = 0, ju < n + 1. Again, if, 
[— /3 — 0! — 2(n — pi)] X + n — pi is divisible by x, we have a + n — pi = 
0 or od an integer. For (a + n — pg) — + a + 2{n — ps)] x = (a -f' n — p^) 

r 1 ~ ^ ^ pO + {p + n — ^ 1 divisible by 1 — x requires fi + n — 

L {ot + n- P2) J 

P2 = 0 or jS, an integer, a and /3 are therefore both integers contrary to hypoth- 
esis. Thus, in (5), no polynomial P„-q+i(fc, x) = 0 and /n+iC^J, oty /3) ^ 0. 
Replacing g by n + 1 in (5) leads to, 
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(6) ^ — a - n-i] Po(n, x), 

B = 0, 1, 2, . . . . 

Thus ^ 0, (since Po(n, a:) = 1 and no factor - ^- a- n — icm vanish) 
and the degree of Jn+i is precisely n + 1. From similar reasoning we prove: 
C 2 ) If a + jS > 0 the degree of is n + 1, m = 0, 1, 2, ■ ■ • . 

Cs) If a + /3 = 0, then (I) Ji = a and (II) is of degree n + 1, n = 1, 
2, 3, . ■ ■ 

Cl) If a + /3 = -M - integer, M > 0, (3, a not integers, then, 

(I) For n < M, the degree of J„^i is min. (n + 1, Jlf - n). 

(II) n = M, /„+! = const. 

(Ill) n > M, the degree of J„^i is w + 1. 

Cs) If a + 0 = - M - integer, M > 0, a, ^ integers, a > 0, ^3 < 0, then, 

(I) For n < M, the degree of J„+i is min. {n 1, M — n). 

(II) n = M, Jn+i = const. 

(Ill) n > M, the degree of J^+i is n + 1. 

0«) If a + ;8 = — M - integer, iff > 0, a = - /ci-integer, ^ = -fcj-integer, 

ki < ki then, 

(I) For n <ki, is of degree n + 1. 

(II) n>ki, Jn+i s 0. 

C 7 ) If a + /3 = — M — integer, Af > 0, a = ^ = — fci-integer, then, 

(I) For n <k\, is of degree n + 1, 

(II) n > fci ,/„+! = 0. 

The Laguerre Polynomials. These are defined as follows: 

L„ ^ Z,„ (x, a) = ti = 0, 1, 2, • • ■ ; 

a — real. We see that L„ is a special case of Pn(fc, x) with N = — k a, 
D s X, n = k + 1. It follows that 6 = —1, /j, = a + k — n, am = a + 
and Pi{k, x) = 1., These can be used in determining the location of the real 
zeros of Ln , as was done for J„ . The discussion here is somewhat simplified 
since Ln has but one parameter, a, and the a:-axis is divided by the zeros of Dix) 
into, two segments only, namely, (— «>, 0) and (0, «). 

The following results are easily obtained. 

Bi : a > 0, Ln(jc, a) has n distinct zeros in (0, 00 ), n = 1, 2, 3, ■ • • . This 
result is well known. 

B 2 :‘ od = 0. L„+i(a:, a) has n distinct zeros in (0, 00 ) and a simple zero at x = 0, 

Ti = 0, 1, 2, . • • . 

Ba : a < 0, a, not an integer. Let h be the largest integer contained in a. 

(I) Lj,+i(x, a) has zeros in (- «, 0), 0 < k < h, 
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, (l)*l + (-1)*' 

(II) Lki+g+i{x, a) has q distinct zeros in (0, ■») and zeros in 

(_ co,0),g = 0,l,2, ... . 

Bi : a < 0, a = —h - integer. 

(I) Lk+iix, a) has — — — zeros in (- w, 0), 0 < fc < * 1 . 

(II) Lki+g+i{x, a) has q distinct zeros in (0, “ ) and a zero of order fci + 1 at 
a: = 0, g = 0, 1, 2, . . . . 

The Degree of L„{x, a). We show first that here Pn{n, «) 0, ^ = 1, 2, . . ■ 
n + 1. By definition, Pi(n, x) ^ N + nD' = —x + a + n ^ 0. Let us 
rewrite (2) for our present situation thus : 

(2®) ?'(n, x) - -ixP^-iin, x). If, now, P^{n, x) = 0, then from (2®) it follows 
that P„_i(n, x) s 0. Continuing this reasoning, we finally arrive at a contrar 
diction, namely, Pi{n, x) s 0. If in (4) we set g = n + 1 and replace N' and D" 
by their values we get: 

«) = + 1)! Po(«, X) = + 1)1 

Hence, is of degree n + 1. Note that this holds regardless of the value of 
ct contrary to what was found for Jacobi Polynomials. 

Points of Inflexion. By a procedure analogous to that used for Jacobi Poly- 
nomials we can show that the points of inflexion of a) are located at the 

zeros of odd order of Ln-i(a;, a + 2). 

The Polynomials Pn(0, x). If we set fc = 0 in (1), (2), and (3) we obtain the 
following relationships for Pn(0, x)^ = Pn(x) = Pn . 

(7) Pnti(x) nD^] Pn(x) + DP^x). 

(8) PU,{x) = (n + 1) [iV' - P„(x). 

(9) P„+:(x) =.[N~ nD'] P„(x) + n(N' - ^ D") I>P„-i(x). 

Theorems Ix to Zio inclusive, with fe = 0, hold for Pn(x). In addition, the 
following theorems hold for Pn ■ 

TheoTem Hi. Suppose N linear and D{x) > 0 for all x. Furthermore^ let 
m 

N' — P" < 0, w == 1, 2, 3, ' « • . Then Pn has n reaJ, distinct zeros which 
separate the zeros of Pn+i . 

Proof: Denote the zeros of P„ by anj , f - 1, 2, • • . n, a„,,- < an.t+i . Suppose 
N' > 0. N being linear has a single zero an . Furthermore, since Pi ^ 
then Pi < 0 for rc < an and > 0 for x > an • We pass now to Pa . From (7), 
we see that Pa (an) > 0, (since P > 0 and Pi > 0). Also (8) shows PaCx) > 0 


® E. H. Hildebrandt, loc. cit. pp. 399. 
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for X o!ii and 0 for x !> an . This follows from whst wfls Doted concerning 

the sign of Pi for x > au and x < an , together with the hypothesis that F' - - 

2 

D" < 0. Thus, there exists a zero of Pj in (- «, an) and a zero in (an , oo) 
and our theorem holds for n = 1. Assume that the theorem is true for n = h. 
The sequence Ph{oih,i), ■<■ = 1, 2, • • • /i, is alternately positive and negative. 
Since, from (8), the only extrema of P^+i are at a^,.' , f = 1, 2, • • ■ fi, we conclude 
that there axe h — 1 zeros of P;.+i separating the a^.^ , f = 1, 2, • ■ ■ A. Since 
P'hicih.i) > 0 we conclude that P^ < 0 for x < %,i . This fact, combined with 
(8), shows P'h+i(x;) > 0 for X < . Ph+i{aK,i) being positive, it follows that 

there exists a zero of P^+i in (- oo, a^n). Similar reasoning establishes the 
existence of a zero of P^+i in (a/,,^ , oo). Our theorem is thus established for 
N' > Q. The case iV' < 0 can be similarly treated. 

Theorem Hi : If D{x) > 0/or all x, P" < 0, iV' — ^ D" < 0, iV' = 0, iV 0, 

then Pn , n = 2, 3, • • • , has n ~ 1 real, distinct zeros which are separated hy the 
xeros of P„_i . 

Proof: Since Pi^ N = const., we see from (7) that Pi is linear. The reason- 
ing of Theorem Hi applies where we now start with Pi . 

Theorem Ht : If D(x) > Ofor all x, except x = jS, where D has a doulle zero and 

if N' Q, N' < 0, n = 1, 2, 3, • • • , then Pn fias n real, distinct zeros 

which separate those of P„+i . 

Proof: Theorem 7i with h = 0 assures us that P„ and D have no zeros in 
common. The proof now follows the line of reasoning of Theorem Hi . 

Theorem Hi : If D{x) >0 for all x except x = /3 where D has a douUe zero and 

if N' = Q, N ^ Q, N' - ^ D” < 0, m - 1, 2, 3, ■ • • , then P„ has n — 1 real, ■ 

distinct zeros which sepo/rdte those of jri = 1, 2, 3, • • • . This theorem follows 
from Hs as did from Hx . 

Points of Inflexion. Setting /c = 0 in (4) leads to^ 


pff 
A n+l 


{n + 



Pn-l. 


This shows, under the assumptions of Theorems Hi to inclusive, that the 
points of inflexion of Pn+i are at the zeros of Pn^i • 

Hermite Polynomials. Theorem Hi and statement immediately above con- 
cerning points of inflexion apply directly to Hermite Polynomials where N = 
and D ^ 


Lehigh Univebsity. 



THE SIMULTANEOUS COMPUTATION OF GROUPS OF REGRESSION 
EQUATIONS AND ASSOCIATED MULTIPLE CORRELATION 
COEFFICIENTS 

I ' , 

By Paul S; Dwybk 

1 . Introduction, The need sometimes arises for the prediction of a number of 
different variables from a given group of so-called fundamental variables. In 
the work of college prediction, for example, one might desire regression equations 
predicting certain measures of college achievement (e.g., first semester average, 
first semester English grade, first semester mathematics grade, number of hours 
of A received during first semester, etc.) on the basis of a number of other factors 
(e.g., high school record, score on American Council on Education Psychological 
Examination, score on some standard English achievement test, score on some 
standard mathematics achievement test, etc.). It is the purpose of this paper 
to show how the regression coefficients and the associated multiple correlation 
coefficients can be obtained simultaneously. The essence of the method is a 

, simple device by which one solution of general normal equations may be made to 
serve for all cases. 

2. The normal equations. Let Xi, Xi, xs, - Xn, be the so-called funda- 
mental variables and let Xn, be the predicted yariable. The normal equations 
are computed by standard methods which result in one of the three types. 

Type I. ' Normal equations for determining 6o , i>i , 62 1 63 , • ■ ■ , &n . 


bofi -f- biSaii -f- bi^Xi -p bjSiCa -p -p bn^Xn — Ssj = 0 

boUxi -p biUxi -p biSxiXi -p b^SxiXi -p -P bnSxiXn — SxiXt = 0 

hoZxi -p biSxiXs -p -p bsSxiXs -p -p bnSxiXn — ^XiXk = 0 


ba^Xn -p biZXnXl "P biSXnXt -p bs^XnXi “p ■ . "P bn^X^ — = 0 

Type II. Normal equations for determining bi , b2 , 63 , • • • , . 

Xi = X( — Jtf*,. 

+ biXxiXi -p biSxiXs -p -p bftSxiXn — S£iXk = 0 

biZXkXi -P b2SX2 -p bilXtXi -p -p bnXXiSn - = 0 

bi2x„Xi -p biSXnXi + bsSXnXi -P -p bn^xl — SXnX/e = 0 
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Type III. Normal equations for determining , • • • , jSn . 

01 + ri202 + n303 + rin0n - fife = 0 

^2101 +02 + r2303 + + r2n0rt - = 0 


TnlPl + ?'n202 + r„303 + + rnn0n — Tnfc = 0 

The three types are special cases of the general 

duVi + di2y2 + disj/i + + dij-yj + + duyn — dik = 0 

daiyi + d22y2 + d2^yz + + d2jyj + + dznyn — (hk — 0 

dnyi + d^2y2 + dss^/a + . .^. + + + d^yn — dsk = 0 

diiVi + di2y2 + dizys + + dijyj- + + — dik = 0 


dnlVl + dn2y2 j]r dn^y^ + + dnj2/; + + dnnVn — dnfc = 0 

where are the regression coefficients and dij = dji . 

The methods described in this paper are applicable to the general case and 
hence to each of the three particular types. 

In examining the normal equations, it is noticed that the first n terms of each 
equation are completely determined by the n fundamental variables. The 
equations, aside from the last terms, are identical no matter what variable is 
predicted. It is only necessary to devise a technique for separating the con- 
tributions of the dik terms. 

3. Solution by determinants. One method utilizes determinants. The 
value 2/i is expressed in terms of a determinant involving a column with entries 
dik i d2kj dzki ’ • • ,dnk- The determinant is expanded in terms of this column. 

Specifically, let D be the determinant of the coefficients of the yj and let Da 
be the cofactor of any element of D. Then 

n 

D = Dii dij 
and 

^ (Dll dlh + D 2 I d2k + Dfli dzk +....+ Dj'i dj'lc +....+ Dnl dnk •) 

y2 ^ (Di 2 dik + D 22 d2k + D 32 dsfc + 1 ■ • • + D]2 dj* + • • ■ • + Dn2 dnk •) 


2/^ s= i {Diidik + D?<d2jk + Diidzk + ^ ♦ • • + T^iidjk +•■••+ T)i\idnk) 


i (Din dlA + D2n + Dsn dzk +••••+ n djh +,..■+ Dnn dr^ •) 
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It is only necessary to compute — to find the coefficient of djk in the expansion 
of 2/<. 

An illustration is given. The normal equations are 

i8i + .3300 (Sa + .2100 /I3 - ru- = 0 

.3300 ft + /Sa - .4800 ft - rat = 0 

.2100 ft — .4800 ft + ft — rai = 0 

from which at once 


~ (.7696 m - .4308 rai - .3684 rai) 

02 = ^ (— .4308ru + .9559 raj: + .5493 rat) 

ft = i (-.3684rn + .54937-2* + .8911 ra*) 
and also ^ 

D = .550072 = (L00)(.7696) + (.33)(-.4308) + (.21)(-.3684) 

^ = (.33) (--.4308) + ( 1.00)(.9559) + (-.48)(.5493) 
= (.21)(-.3684) + (-.48)(.5493) + ( 1.00)(.8911) 

so that 

/3i = 1.3991 rifc - .7832 - .6697 . 

/?2 ^ —.7832 Tik “h 1.7378 v^k "1“ .9986 r^k - 


= —.6697 rik + .9986 r^k + 1.6200 nk . 

It is only necessary to insert any given values r^k , T^k , Tzk , to obtain the coeffi- 
cients of any specific regression equation. 


4 . Solutions without determinants. Theoretically the solution by deter- 
minants is excellent but as the number of variables increases the work of com- 
puting the cofactors or the ^ - different cofactorsl becomes enormous. 


We desire a technique for separating the contributions of the last terms when 
determinants are not used. This can be accomplished by using a separate 
column for each dik . Before algebraic manipulation, the value da is factored 
from the column and, after manipulative solution is complete, the multiplication 
by dt75. is carried out. 
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As an example consider the normal equations 

^1 4 “ ^ 12^2 — = 0 

ri/321 + J02 ~ = 0 

where ria = r 2 i = ,3300. Then the normal equations may be represented by 
rows (1) and (2) of Table I. 


TABLE I 


Row 

Operation 

iSl 

^2 

rik 

r2jt 

(1) 


1.0000 

.3300 

-1.0000 


(2) 


.3300 

1,0000 


-1.0000 

(3) 

— . 3300 times (2) 

- .1089 

- .3300 


.3300 

(4) 

(1) + (3) 

.8911 


-1.0000 

.3300 

(5) 

— (4) divided by .8911 

-l.OOOOi 


1.1222 

- .3703 

(6) 

- .3300 times (5) 

.,3300 


- .3703 

.1222 

(7) 

“■ (2) + (6) 


-1.0000 

- .3703 

1 . 1222 


The four decimal place solution, whose steps are indicated by (3) (4) (5) (6) (7), 
is from (5) and (7) 

ft = 1.1222 ru - .3703 r2k 
ft == -.3703 Tik + 1.1222 r2k 

This device may be combinea with most of the standard methods of solving 
normal equations. 

5. Combination with Doolittle method. Especially to be recommended is a 
combination of this device with the Doolittle method which is recognized as a 
most efficient method of solving normal equations in from five to ten variables 
[1] [2]. One of the advantages of the Doolittle method is that related multiple 
regression coefficients may be obtained from the same forward solution, though 
additional back solutions are necessary [3]. 

The problem which led to the development of this technique was the simul- 
taneous prediction of scores on various occupations covered by the Strong 
Vocational Interest Blank from the scores on a few fundamental occupations. 
A multiple factor analysis revealed that five basic factors account for most of the 
scores. Five occupational scores, serving as approximations to the five basic 
factors, were used as the fundamental variables and the other scores were 
predicted from them. 

As an illustration of this prediction technique combined with the Doolittle 
method, I have selected three test scores as fundamental since the solution based 
on them shows all the steps of the Doolittle method and is shorter than the five 



228 


PAUL S. DWYER 


variable problem. Actually, solution by determinants (section 3) is advised 
for problems involving three variables. The steps of the Doolittle solution are 
presented in Table II. The results should be compared with those of the 
determinant solution of section 3. 

The first column indicates the row and the second the description of the 
algebraic operation. The next three columns are the standard columns of a 
Doolittle presentation with the conventional elimination of the lower left entries. 
The next three columns carry through the Doolittle method with the values 
Tik f T 2 k , Tu kept in separate columns. The last column is an adaptation of the 
conventional summary check column of the Doolittle solution. 

TABLE II 


Generalized Doolittle Presentation 


Row 

OpetaUon 

Pi 1 

pi 

Pi 

rife 

rife 

TSfe 

S 

(1) 



.3300 

.2100 

-1.0000 




(2) 


.3300 

1.0000 

-.4800 





(3) 


.2100 

-.4800 

1.0000 





(4) 

Repeat (1) 


■1 

.2100 

-1.0000 




(5) 

Negative of (4) 

HQ 


-.2100 

1.0000 




(6) 

Repeat (2) 



-.4800 





(7) 



-.1089 

-.0693 




-.1782 

(8) 

(6) + (7) 


.8911 

-.5493 

.3300 

-1.0000 


-.3282 

(9) 

— (8) divided by 



.6164 

-.3703 

1.1222 


.3683 


.8911 








(10) 

Repeat (3) 








(11) 

— .2100 times (4) 







-.1134 

(12) 

.6164 times (8) 



-.3386 


- .6164 


-.2023 

(13) 

(10) + (11) + (12) 



.6173 

.4134 

-.6164 


- .5857 

(14) 

— (13) divided by 




-.6697 

.9985 

1,6200 

.9488 


.6173 








(15) 

.6164 times (14) 



-.6164 

-.4128 

.6165 

,9986 

.5848 

(16) 

(9) + (16) 




-.7831 

1.7377 

.9986 

.9531 

(17) 






-.mi 

-.3402 

-.1992 

(18) 



.3300 


.2584 

-.573Jt 

-.3295 

-.3146 

(19) 

(6) + (17) + (18) 





-.7831 

-.6697 

-1.0537 


The general solution is read from rows (19) (16) (14) and is 

jSi = 1.3990 - .7831 m - .6697 rs* . 

|3s = -.7831 ru -f 1.7377.r2i + .9986 r,* . 

^3 = -.6697 rii + .9985 rzk + 1.6200 . 
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which agrees, aside from the last place, with the result of the solution by de- 
terminants. 

It is wise to check in the original equations (1), (2), (3) as soon as any is 
found. Row (14), for example, should be checked by showing 

(-.6697) (1.00) -f (.9985)( .33) -f (1.6200)( .21) = .0000 

(-.6697)( .33) + (.9985)( 1.00) 4- (1.6200) (-.48) = -.0001 

(-.6697)( .21) + (.9985)(-.48) -|- (1.6200)( 1.00) = 1,0001 

The same should be done with row (16) as soon as it is computed. Row (19) 
should be treated similarly. 

6. Many regression equations. If large numbers of regression equations are 
to be generated (the Strong Vocational Interest Study had 29 dependent va- 
riables), the following technique is suggested. Make a table with columns 
rik , rik , etc. and use the rows to indicate the different values of k. On another 
slip of. paper insert the general values , § 2 , ^ 3 , • • • in successive rows so 
that a folding of the paper will bring any general ^ expansion in conjunction 
with the r’s of any test, k. The scheme is illustrated in Table III. 


TABLE III 


No. 

Occupation 

' rile 


rsjfc 


ySifc 




r 

1 

Teacher 

1.00 

.33 

.21 


1.00 

.00 

.00 


1.00 

■2 

Physicist 

.33 

1.00 

-.48 


.00 

1.00 

.00 


1.00 

3 

Office Worker 

.21 

-.48 

^ 1.00 


.00 

,00 

1.00 


1.00 

• 4 

Doctor 

.17 

.79 

-.52 


-.03 

.72 

-.17 


.81 

5 

Lawyer 

-.02 

.16 

-.59 


.24 

-.30 

-.78 


.64 

■' 6 

Engineer 

.16 

.78 

-.02 


1 

CO 

1,21 

.64 


.93 







t 








-.7831 

-.6697 



t 






-.7831 

1.7377 

.9986 




t 




Pi 

-.6697 

.9986 








m 

Mathematician 

.46 

.96 

-.49 


.19 

.82 

-.14 


.97 


etc. 











Thus, for the occupation of Engineer, 

jSi = 1.3990 (.16) -b (-.7831)(.78) 4- (-.6697) (-.02) = - .37 

|32 = -.7831 (.16) 4 - ( 1.7377) (.78) 4 - ( .9996) (-.02) = 1.21 

ft = -.6697 (.16) 4 - ( .9985) (.78) 4 - ( 1.6200)(-.02) = .64 
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The value of the multiple correlation coefficient is then computed from the 
formula 

rjfc.123 n = 'X/ PlkTik + 02kT2k +•■••+ PnkTnk 

In the illustration above 

r,.i23 = V(-.37)(.16) + (L21)(,78) + (.64)(-.02) 

= .93 

7. Regression equations by deletion. The method of getting related regres- 
sion coefficients and correlation coefficients, described by Kurtz [3], is also 
applicable. Again, a problem involving more than three variables is needed to 
show the real value of the scheme but the technique may be illustrated in the 
three variable case. We wish to find, from the forward solution of Table II, 
the regression equation and the multiple correlation coefficient when the first two 
fundamental variables only are used. We delete all columns involving test 3 
and complete the back solution as indicated in Table IV, which may be viewed 
as a substitute for the last ten rows of Table II. 


TABLE IV 
(See Table II) 


Row 

Operation 

Pi 

02 

03 

rih r 

Tik 

r 3 k 

8 

(20) 

(21) 

(22) 

, Repeat (9) 

— .3300 times (20) ' 

(6) + (21) 

-1.0000 

- 1,0000 

.3300I 


-.3703 

.1222 

1.1222 

1.1222 

-.3703 

-.3703 




The results are 

/3i = 1.1222 ru - .3703 rajt. 

P 2 = -.3703 ru + 1.1222 rzfc. 
and these agree with the results of section 4. 

8. The simplified back solution. In every case in which the /5’s have been 
given in terms of r's the matrix of the coefficients is symmetric (sections 3, 4, 5, 7). 
One wonders if this sjonmetry is generally true and if it holds for normal equa- 
tions of Type I or Type II. 

Determinants are much more useful in establishing general properties, such 
as the one under discussion, than they are in computing the values of regression 
coefficients in the case of a problem involving many variables, r; We return to the 
determinant notation of section 3, 

In each of the three types, and hence in the general case dif = da so that D is a 
symmetric determinant, = D,,^ and ^ ^ • Hence the matrix of the 

coefficients of the solution is symmetric. 
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This result may be used (1) to check the expanded results or (2) to eliminate 
some of the work of the back solution. The n coefficients must be recorded for 
pn after which the column indicated by Tnk may be dropped. The first n - 1 
coefficients must be computed for Pn-i after which the column indicated by 
rn-i,k may be dropped, etc. The italicized entries in Table II are the ones 
which are eliminated in this way. The remaining coefficients are sufficient to 
completely determine the symmetric matrix. 

The summary right hand check column can not be readily used in the simpli- 
fied back solution but it is hardly to be^recommended anyway, Kurtz [3] 
argues against it on the ground that it is not necessary. The essential check is 
to see that each p solution satisfies all of the original equations. 

9. Conclusion. This paper provides a technique for the computation of 
general regression equations and shows how the technique may be combined 
with the Doolittle method in providing a practical means of mass prediction. 

University of Michigan. 
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CONSTITUTION 


AtlTIOLE I, 

NAME AND PURPOSE 

1. This organization shall be known as the Institute of Mathematical Sta- 
tistics, 

2. Its object shall be to promote the interests of mathematical statistics. 

Ahticle II 
MEMBERSHIP 

1. The membership of the Institute shall consist of Members, Fellows, 
Honorary Members, and Sustaining Members. 

2. Fellows shall be the only voting members of the Institute. 

Article III 

OFFICERS, BOARD OF DIRECTORS, COMMITTEE ON MEMBERSHIP, 
AND COMMITTEE ON PUBLICATIONS 

1. The Officers of the Institute shall be a President, two Vice-Presidents, 
and a Secretary-Treasurer, elected for a term of one year by a majority ballot 
at the annual meeting of the Institute. Voting may be in person or by mail. 

(a) Exception. The first group of Officers shall be elected by a majority 
vote of the individuals present at the organization meeting, and shall serve until 
December 31, 1936. ' 

2. The Board of Directors of the Institute shall consist of the Officers and 
the previous President. 

3. The Institute shall have a Committee on Membership composed of three 
Fellows. At their first meeting subsequent to the adoption of this Constitution, 
the Board of Directors shall elect three members as Fellows to serve as the 
Committee on Membership, one member of the Committee for a term of one 
year, another for a term of two years, and another for a term of three years. 
Thereafter the Board of Directors shall elect from among the Fellows one 
member annually at their first meeting after their election for a term of three 
years. The president shall designate one of the Vice-Presidents as Chairman 
of this Committee. 

4. The Institute shall have a Committee on Publications composed of three 
Members or Fellows elected by the Board of Directors. The President shall 
designate a Vice-President as Ex Officio Chairman of this Committee. 
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Artioub IV 

MEETINGS 

1. A meeting for the presentation and discussion of papers, for the election of 
Officers, and for the transaction of other business of the Institute shall be held 
annually at such time as the Board of Directors may designate. Additional 
meetings may be called from time to time by the Board of Directors and shall be 
called at any time by the President upon written request from ten Fellows. 
Notice of the time and place of meeting shall be given to the membership by the 
Secretary-Treasurer at least thirty days prior to the date set for the meeting. 
All meetings except executive sessions shall be open to the public. Only 
papers accepted by a Program Committee appointed by the President may be 
presented to the Institute. 

2. The Board of Directors shall hold a meeting immediately after their 
election and again immediately before the expiration of their term. Other 
meetings of the Board may be held from time to time at the call of the President 
or any two members of the Board. Notice of each meeting of the Board, other 
than the two regular meetings, together with a statement of the business to be 
brought before the meeting, must be given to the members of the Board by the 
Secretary-Treasurer at least five days prior to the date set therefor. . Should 
other business be passed upon, any member of the Board shall have the right to 
reopen the question at the next meeting. 

3. The Committee on Membership shall hold a meeting immediately after the 
annual meeting of the Institute. Further meetings of the Committee may be 
held from time to time at the call of the Chairman or any member of the Com- 
mittee provided notice of such call and the purpose of the meeting is given to 
the members of the Committee by the Secretary-Treasurer at least five days 
before the date set therefor. Should other business be passed upon, any 
member of the Committee shall have the right to reopen the question at the 
next meeting. 

4. At a regularly convened meeting of the Board of Directors, three members 
shall constitute a quorum. At a regularly convened meeting of the Committee 
on Membership, two members shall constitute a quorum. 

Article V 
PUBLICATIONS 

1. In the beginning, the “Annals of Mathematical Statistics'^ shall serve as 
the official journal for the Institute.^ Other publications may be originated 
by the Board of Directors as occasion arises. 

Article VI 

EXPULSION OR SUSPENSION 

1. Except for non-payment of dues, no one shall be expelled or suspended 
except by action of the Board of Directors with not more than one negative vote. 
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Article VII 
AMENDMENTS 

1. This constitution may be amended by an affirmative two-thirds vote at 
any regularly convened meeting of the Institute provided notice of such proposed 
amendment shall have been sent to each Fellow by the Secretary-Treasurer at 
least thirty days before the date of the meeting at which the proposal is to be 
acted upon. Voting may be in person or by mail. 

BY-LAWS 

Article I 

DUTIES OF THE OFFICERS, BOARD OF DIRECTORS, COMMITTEE. 
ON MEMBERSHIP, AND COMMITTEE ON PUBLICATIONS 

1. The President, or in Ms absence, one of the Vice-Presidents, or in the 
absence of the President and both Vice-Presidents, a Fellow selected by vote 
of the Fellows present, shall preside at the meetings of the Institute and of the 
Board of Directors. At meetings of the Institute, the presiding officer shall 
vote only in the case of a tie, but at meetings of the Board of Directors he may 
vote in all cases. At least three months before the date of the annual meeting, 
the President shall appoint a Nominating Committee of three members. It 
shall be the duty of the Nominating Committee to make nominations for 
Officers to be elected at the annual meeting and the Secretary-Treasurer shall 
notify all Fellows at least thirty days before the annual meeting. Additional 
nominations may be submitted in writing, if signed by at least ten Fellows of 
the Institute, up to the time of the meeting. 

2. The Secretary-Treasurer shall keep a full and accurate record of the 
proceedings at the meetings of the Institute and of the Board of Directors, 
send out calls for said meetings and, with the approval of the President and the 
Board, carry on the correspondence of the Institute. Subject to the direction 
of the Board, he shall have charge of the archives and other tangible and 
intangible property of the Institute. He shall send out calls for annual dues and 
acknowledge receipt of same; pay all bills approved by the President for expendi- 
tures authorized by the Board or the Institute; keep a detailed account of all 
receipts and expenditures, prepare a financial statement at the end of each year 
and present an abstract of the same at the annual meeting of the Institute after 
it has been audited by a Member or Fellow of the Institute appointed by the 
President as Auditor! The Auditor shall report to the President. 

3. The Board of Directors shall have charge of the funds and of the affairs 
of the Institute, with the exception of those affairs specifically assigned to the 
President or to the Committee on Membership. The Board shall have au- 
thority to fill all vacancies ad interim, occurring among the Officers, Board of 
Directors, or in any of the Committees. The Board may appoint such other 
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committees as may be required from time to time to carry on the affairs of the 
Institute. 

4. The Committee on Membership shall prepare and make available through 
the Secretary-Treasurer an announcement indicating the qualifications requisite 
for ^he difierent grades of membership. 

5, The Committee on Publications, under the, general supervision of the 
Board of Directors, shall have charge of all matters connected with the publica- 
tions of the Institute, and of all books, pamphlets, manuscripts and other 
literary or scientific material collected by the Institute. Once a year this 
Committee shall cause to be printed in the Official Journal the Constitution 
and By-Laws and a classified list of all the Members and Fellows of the Institute. 

Articije II 
DUES 

1. Members shall pay five dollars at the time of admission to membership 
and shall receive the full current volume of the Official Journal. Thereafter, 
Members shall pay five dollars annual dues. The annual dues of Fellows shall 
be five dollars. The annual dues of Sustaining Members shall be fifty dollars. 
Honorary Members shall be exempt from all dues. 

2. Annual dues shall be payable on the first day of January of each year. 

3. The annual dues of a Fellow or Member include a subscription to the 
Official Journal. The annual dues of a Sustaining Member include two sub- 
scriptions to the Official Journal. 

4. It shall be the duty of the Secretary-Treasurer to notify by mail anyone 
whose dues may be six months in arrears, and to accompany such notice by a 
copy of this Article. If such person fail to pay such dues within three months 
from the date of mailing such notice, the Secretary-Treasurer shall report the 
delinquent one to the Board of Directors, by whom the personas name may be 
stricken frona the rolls and all privileges of membership withdrawn. Such 
person may, however, be re-instated by the Board of Directors upon payment 
of the arrears of dues. 


Article III 


SALARIES 

1. The Institute shall not pay a salary to any Officer, Director, or member of 
any committee. 

Article IV 
AMENDMENTS 

1. These By-Laws may be amended in the same manner as the Constitution 
or by a majority vote at any regularly convened meeting of the Institute, if the 
proposed amendment has been previously approved by the Board of Directors. 
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